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PREFACE 


The present volume is intended as an all-round introduction to con- 
structivism. Here constructivism is to be understood in the wide sense, and 
covers in particular Brouwer’s intuitionism, Bishop’s constructivism and 
A.A. Markov’s constructive recursive mathematics. The ending “-ism” has 
ideological overtones: “constructive mathematics is the (only) right 
mathematics”; we hasten, however, to declare that we do not subscribe to 
this ideology, and that we do not intend to present our material on such a 
basis. 

The first successful introduction to constructive mathematics, more 
specifically to intuitionistic mathematics, was Heyting’s “Intuitionism, an 
introduction”, which first appeared in 1956 and went through three edi- 
tions. At the moment of its appearance Heyting’s book was unique; now 
there is a whole collection of introductory texts and monographs dealing 
with constructive mathematics in its various forms, as well as with the 
metamathematics of constructive mathematics. Let us therefore indicate the 
principal features of the present book. 

(a) It treats constructive mathematics in its various forms, as well as the 
metamathematics of constructivism. Each can serve as useful background 
information for the other. 

(b) The treatment of constructive mathematics does not attempt to be 
systematic, only to give some illustrative examples. 

(c) We have tried to select topics we expected to retain their interest for 
some time to come; we have not tried to follow the latest fashions. 

(d) The treatment of the metamathematics is intended to be essentially 
self-contained; therefore we have included certain details, such as the 
formalization of elementary recursion theory, which are not part of stan- 
dard introductory texts on mathematical logic. We assume only some 
familiarity with classical first-order predicate logic, preferably in natural 
deduction style, such as may be obtained from many introductory texts. 
Some familiarity with elementary recursion theory, though not absolutely 
necessary, is also helpful. 
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The material suitable for a first introduction to the topic is properly 
contained in the first six chapters. More specifically, the introductory 
material consists of chapter 1, sections 2.1-6, 3.1—-5, 4.1-7 and the chapters 
5 and 6. In addition the chapters 1-4 contain more specialized sections, 
which have been placed there for systematic reasons. We have tried to avoid 
too much interdependence in the introductory material; hence there are 
several selections for an introductory course possible, for example: 

Selection 1, with emphasis on examples of constructive mathematics, 
consists of the following sections and subsections: 1.1—-3, 3.3.1-2, 3.3.6 (first 
proof), 3.4, 4.1-3, 4.5, 5.1-4, 6.1-2, 4.6.1-10, 4.6.13-14, 4.7, 6.3, 6.4.1-11, 
6.4.13, 4.4 (sketchy), 6.4.12. 

The logical aspects may be given somewhat more emphasis by also 
treating 2.1, 2.3.1-6, 2.5.1-11, and the whole section 3.3 except for the 
second proof in 3.3.6. 

Another possible extension of this selection is 5.5—7. 

Selection 2, with emphasis on logical and metamathematical aspects, 
consists in the following sections and subsections: 1.1-3, 2.1, 2.3.1-6, 
2.4.14 (optional), 2.5.1-13, 2.6 (optional), 3.1—5, 4.1-7, possibly extended 
with 5.1-4. 

Occasional reference to material outside these selections may be skipped. 
These selections can be varied in many different ways. 

The treatment of the introductory material is more leisurely and detailed 
than in subsections and later chapters of a more specialized nature; there 
we often leave the routine proofs as an exercise for the reader. 

Most of the material has been tried out in courses at Amsterdam and 
Utrecht. 

All chapters, except the last one, have a final section entitled “Notes”. 
These are reserved for credits, historical remarks, asides, suggestions for 
further reading etcetera. The historical remarks deal primarily with the 
constructivist tradition and its metamathematics. We did not attempt to 
trace systematically the history of all the topics treated, and there is no 
pretense of completeness. 

The final chapter of volume 2, the “Epilogue”, contains some observa- 
tions of a general nature prompted by the technical developments in the 
earlier chapters, as well as a brief discussion of some controversial issues. 

In order to make independent use of the first volume possible, we have 
provided a separate index, bibliography and list of symbols to volume 1. 

Due to circumstances beyond our control the burden of authorship has 
not been equally divided. Specifically, the first author produced the bulk of 
the material and the second author wrote section 1.4 and chapter 8. Chapter 
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3 is the joint responsibility of both authors. Furthermore both authors 
accept the responsibility for the contents in the sense that the material was 
the subject of mutual discussions, criticisms and emendations. 

H. Luckhardt read through an early draft of the manuscript and provided 
a long list with corrections. 

We owe a special debt to I. Moerdijk, who over the past few years rapidly 
reversed the roles in the teacher—pupil relationship. In chapters 13-15 we 
made at several places liberal use of handwritten notes prepared by him. 
André Hensbergen tested the exercises of chapters 7 and 10, Rineke 
Verbrugge assisted us with research for some of the notes. 

The job of preparing the final version of the manuscript was made 
considerably easier by Yvonne Voorn, who typed a draft of the manuscript 
on a wordprocessor. 


Muiderberg /De Meern A.S. TROELSTRA 
January 1987 D. vaN DALEN 


PRELIMINARIES 


1. Internal references. Within chapter n, “k.m” refers to the subsection 
numbered k.m in chapter n; “section k” refers to section k of chapter n; 
“section k.m” refers to section m of chapter k (k # n), and “k./.m” refers 
to subsection /.m of chapter k (k #n). “Exercise k.l.m” or “Ek./.m” 
refers to the exercise numbered k./.m at the end of chapter k. An exercise 
numbered k./.m should be regarded as belonging to section k./. 


2. Bibliographical references. are given as author’s name followed by year 
of publication, possibly followed by a letter in the case of more than one 
publication in the same year by the same author, e.g. “as shown by 
Brouwer (1923)...”, or “...in Brouwer (1923A) it was proved that...”, or 
“the bar theorem (Brouwer 1954)...”. References to works by two authors 
appear as “Kleene and Vesley (1965)”. 

In the case of three or more authors we use the abbreviation “et al.”, e.g. 
“Constable et al. (1986)”. The bibliography at the end of the book contains 
only items which are actually referred to in the text; by the completion of 
the logic bibliography Miller (1987) has relieved us of the task of providing 
something approaching a complete bibliography of constructivism. 


In the rest of this section some general notational conventions are 
brought together, to be consulted when needed. There may be local devia- 
tions from these conventions. At the end of the book there is an index of 
notations of more than purely local use. 


3. Definitions. = indicates literal identity, modulo renaming bound vari- 
ables. = is used as the definition symbol, the defining expression appears 
on the right hand side. 


4, Variables, substitution. The concept of free and bound variable is defined 
as usual; for the sets of free and bound variables of the expression a we use 
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FV(a) and BV(a) respectively. As a rule we regard expressions which differ 
only in the names of bound variables as isomorphic; that is to say, bound 
variables are used as position markers only. 

For the result of simultaneous substitution of t,,...,t, for the variables 
X,,...,X, in the expression a we write a[x,,...,x,/t),..-,l,]. We shall 
often use a looser notation: once a(x,,..., x,,) has appeared in a context, 
we write a(t),...,¢,) for a[x,,...,x,/t,..., ¢,,]. Using vector notation as 
an abbreviation we can also write a[X/f] for a[x,,...,X,/ty.--+5 ty). 

In using the substitution notation we shall as a rule tacitly assume the 
terms to be free for the variables in the expression considered (or we 
assume that a suitable renaming of bound variables is carried out). In our 
description of logic in chapter 2 we are more explicit about these matters. 

We shall frequently economize on parentheses by writing Ax or Al (A a 
formula) instead of A(x) or A(t). 


5. Logical symbols. As logical symbols we use 
ae oma) V, A,> ce ,V,4, 2! 


both formally and informally. In bracketing we adopt the usual convention 
that —,V,4 bind stronger than any of the binary operators, and that A, V 
bind stronger than —, ©. Occasionally dots are used as separating 
symbols instead of parentheses. In discussing formal systems it will be 
usually clear from the context whether the symbol is used as part of the 
formalism or on the metalevel. Where it is necessary to avoid confusion we 
use =, = on the metalevel; sometimes a comma serves as a conjunction, 
and “iff” abbreviates “if and only if’. Unless stated otherwise, 4, @ ,3! 
are regarded as abbreviations defined by 


AA=A>L; 
AoB:=(A>B)A(B-A); 
3ixA = 4x(A AVy(x=y © A[x/y])). 


For repeated equivalences we sometimes write A 9 B@ Ce --- mean- 
ing (A @ B)A(BeC)A--- . For iterated finite conjunctions and dis- 
junctions we use 

A, V. 


Notation for restricted quantifiers: 


Vxe X,4dx eX. 
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For iterated quantifiers we use the abbreviations 


VR Ny hay VK Gs VG AX Koi eX, SIN IN, SX, 


and for iterated restricted quantifiers 
Vx,y © A = Vx GC AVy © A etcetera. 


For possibly undefined expressions a, Ea means “a exists”, or “a is 
well-defined” (cf. the use of ¢ | in recursion theory). 


6. Set-theoretic notation. Our standard set-theoretic symbols are 
Ge, €,C,2,\, Nn, UA, U. 


Here C and 2 are used for not necessarily proper inclusion, i.e. X C Y 
= Vx(x © X > x € Y), etc. We also use the standard notations 


{x,,X,...,X,} (for finite sets), 
{x: A(x)}, (f(x): A(x)}, (x © B: A(x)} (fa function). 


For complements of sets relative to some fixed set we often use °. The 
fixed superset involved will be clear from the context. Thus if X is a subset 
of N, we write X° for N \_X. 

For finite cartesian products we use x, for arbitrary cartesian products I]. 

If X isa set and ~ an equivalence relation on X we write X/~ for the 
set of equivalence classes of X modulo ~. For the equivalence class of an 
x © X we write x/~, x_,(x)_ or[x]_. 

The set of functions from X to Y is written as X > Y or Y*. Restrictions 
are indicated by [ or |. P(X) is the powerset of X. 

“f is a function from X to Y” is written as fe X — Y (and only 
occasionally as f: X > Y); the use of this notation must not be regarded 
as a commitment to the set of all functions from X to Y as a well-defined 
totality. If ¢ is a term we can also introduce “t regarded as a function of the 
parameter (variable) x” by one of the following notations: 


Ax.t or fixet. 


Notations (in diagrams) for injections, surjections, bijections and embed- 
dings are >, >, »», > respectively. 

For the characteristic function of a relation R we use x p, where x p(t) = 0 
© R(t). 
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For the function f applied to the argument f we usually write f(t), or 
even ft, if no confusion can arise; in general we drop parentheses whenever 
we can safely do so. In certain chapters we use t(t’) or tt’ for term ¢ applied 
to term ?’ (e.g. in chapter 9); in such cases we use square brackets instead of 
parentheses to refer to occurrences in a term, writing ¢[x] instead of t(x) 
etc. 

Pairs, or n-tuples for fixed length, are usually also indicated by means 
of parentheses ( , ); in the case of finite sequences of variable length 
we use { , ): (X,,X,---5,X%,) IS a sequence of length n (notation 
Ith(x,, x,...,X,) =n). For concatenation we use *. A sequence of 
arguments may also be indicated by vector notation; thus we write f(7) or 
ft for f(t,,...,t,), where T= (t,,...,1,)3 i = 5 indicates equality of vec- 
tors. 

We use the abbreviation Ax,x,...x,.f for Ax\(Ax,...(Ax,.t)...). If 
we wish to regard ¢ as a function in n arguments x,, x,,...,X, We use 
comma’s: AX, Xz,...,X, 1. 

For the graph of the function f we write graph( f), dom( /) is its domain, 
range(f) its range. 


7. Mathematical constants. The following constants 
N,Z,Q,R,B,C 


denote the natural numbers, the integers, the rationals, the reals, Baire 
space and the complex numbers respectively. Throughout the book, 
n,m, i, j, k, unless indicated otherwise, are supposed to range over N, and 
a, 8, y, 6 over B or a subtree of B. In metamathematical work we use 7, m 
for numerals. 

For an infinite sequence ay, a,, 0,... (i.e. a function with domain N) 
we use the notation (a,),. A notation such as lim(x,,), is self-explanatory. 

The notation for arithmetical operations 1s standard; the multiplication 
dot - is often omitted. 


8. Formal systems and axioms, Formal systems are designated by combina- 
tions of roman boldface capitals, e.g. HA, IQC, EM /, etc. 

If H is a system based on intuitionistic logic, H° is used for the 
corresponding system based on classical logic. 

For the language of a formal system H we often write “(H). If H’ 
extends H, and #(H’) extends £(H), while H’ nN #(H) = H, then H’ is 
said to be a conservative extension of H (H’ is conservative over H). A 
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definitional extension is a special case of a conservative extension, where the 
extra symbols of £(H’) can be replaced by explicit definitions (cf. 2.7.1). 
Axiom schemas and rules are usually designated by combinations of 
roman capitals: REFL, BI, BI,, FAN, WC-N, CT, etc. 
Ht A means “A is derivable in the system H”, and H+ XYZ+ A 
‘*A is derivable in H with the axiom schema or rule XYZ added”. Occasion- 
ally we use subscript notation: 4, A instead of Hf A. 


9. Validity. For validity of a sentence A in a model # we use the notation 
M = A. It is to be noted that if the relations, functions and constants of the 
model .# are defined constructively, then . A also makes sense con- 
structively (cf. 2.5.1). 
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CHAPTER 1 


INTRODUCTION 


In this chapter we first present a brief characterization of the various 
forms of constructivism which shall play a role in the book later on. After 
the review of constructivist trends, we illustrate the idea of constructivity by 
means of some simple and quite elementary examples in section 2; most of 
the points discussed will receive a more thorough treatment in later chapters. 

Section 3 introduces the Brouwer—Heyting—Kolmogorov interpretation 
of the logical operators, and the method of weak counterexamples. 

Section 4 gives a brief historical survey. 


1. Constructivism 


1.1. Time and again, over the last hundred years certain mathematicians 
have defended an approach to mathematics which might be called “con- 
structive” in the broad sense used in this book, in more or less explicit 
opposition to certain forms of mathematical reasoning used by the majority 
of their colleagues. Some of these critics of the mathematics of their (our) 
time not only criticized contemporary mathematical practice, but actually 
endeavoured to show how mathematics could be rebuilt on constructivist 
principles. 

There are, however, considerable differences in outlook between the 
various representatives of constructivism; constructivism in the broad sense 
is by no means homogeneous, and even the views expressed by different 
representatives of one “school”, or by a single mathematician at different 
times are not always homogeneous. Our descriptions below present a 
simplified picture of this complex reality. We shall attempt a brief char- 
acterization of the principal constructivist “trends” or “schools” playing a 
role in this book. 

In our discussions we shall use the adjective “classical” for logical and 
mathematical reasoning based on the usual two-valued logic, in which every 
meaningful statement is assumed to be either true or false. 
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1.2. Finitism. The principal tenets of finitism are: 

(a) only concretely (finitely) representable structures are objects of 
mathematics; operations on such structures are to be combinatorial in 
nature and hence, of course, effective. 

(b) abstract notions such as (arbitrary) set, operation, construction etc. 
have no place in finitist mathematics. 

Kronecker (1823-1891), who with some justification may be regarded as 
“the first constructivist”, may perhaps more specifically be regarded as a 
precursor of the finitist approach. Finitism is characteristically represented 
by Skolem in his paper (1923) and in two books by Goodstein (1957, 1961). 

Finitism also plays a role in Hilbert’s programme: for methods regarded 
by all mathematicians as incontrovertible, to be used in a proof of freedom 
of contradiction of (the bulk of) existing mathematics, Hilbert wanted to 
draw on finitist mathematics; this of course does not mean that Hilbert is to 
be regarded as a finitist. 

A formal system which is regarded as codifying a typical part of finitist 
mathematics is PRA, the system of primitive recursive arithmetic, to be 
discussed briefly in section 3.2. 


1.3. Predicativism. This may be regarded as “constructivism with respect to 
definitions”. In predicativism: 

(a) Definitions of mathematical objects should be predicative, that is to 
say it is not permissible to define an object d by referring to a collection D 
of which the object d is to be an element; in particular, quantification over 
D in defining d is not permitted (avoidance of a “vicious circle” in 
definitions). 

(b) The demand for predicative definitions is regarded as compatible with 
traditional logic, that is to say mathematical statements are assumed to be 
true or false — regardless of human knowledge (this in contrast to intuition- 
ism, described in the next subsection). 

As a rule the set of natural numbers N is regarded as unproblematic from 
a predicativist point of view. This means that one accepts quantification 
over N and thus arithmetically defined predicates as predicatively meaning- 
ful. 

Having thus grasped the idea of an arithmetically defined set or predicate 
(sets of level 1), we may quantify over such sets in constructing predica- 
tively defined sets of level 2, etcetera. 

On the other hand, defining the least upper bound of a set X of reals 
(defined as Dedekind (left) cuts) as the intersection of all left cuts which are 
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upper bounds for X is not permissible: the least upper bound itself occurs 
among the collection of upper bounds. 

The first to show that parts of analysis could be developed predicatively 
was Weyl in his monograph “Das Kontinuum” (1918). More recently 
Lorenzen in his books (1955, 1965) has given a predicative development of 
analysis. Lorenzen (1955), however, does not accept classical logic outright, 
but justifies its use relative to constructive logic. 

Poincaré and the French “empiricists” such as Borel may be regarded as 
precursors of predicativism, but they did not develop parts of mathematics 
predicatively in a systematic way (cf. also 4.2). 

Predicativism as such does not play a role in this book, but the demands 
of predicativism may or may not be combined with the demands of 
intuitionism and Bishop constructivism and thus the issue of predicativity 
occasionally crops up in other constructivist trends as well. 


1.4. Bishop’s Constructive Mathematics (“BCM”). This approach is typi- 
cally represented by Bishop’s book “Foundations of constructive analysis” 
(1967). In the work of this school there are few discussions of philosophical 
principles, the emphasis is entirely on the practice of constructive mathe- 
matics, instead of on its philosophy. 

(a) The key phrase is that “mathematical statements should have numeri- 
cal meaning”. This means, in particular, that existential statements must, in 
principle, be capable of being made explicit (and also, in asserting A V B, 
the choice between A or B must be possible in principle): one can only 
show that an object exists by giving a finite routine for finding it. 

(b) In BCM it is not assumed that all mathematical objects are to be 
given in the form of an algorithm (in the mathematically precise sense of 
recursive function theory). Choice sequences (cf. 1.6 below) are not ad- 
mitted as legitimate objects. Bishop does not hesitate to accept abstract 
concepts such as “rule” or “operation”. 

In BCM no definite position is taken with respect to the admissibility of 
impredicative definitions; in practice, generalized inductive definitions are 
admitted (Bishop 1967, p. 68); for a discussion of this type of definition see 
section 4.8. 


1.5. Constructive Recursive Mathematics (“CRM”). This is a form of con- 
structivism developed by Markov (1903-1979) from ca. 1950 onwards. In 
the sense we understand it here it is typically represented by the writings of 
Markov, Shanin and their students in the period 1950-1967. 
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A feature CRM has in common with finitism is that mathematical objects 
should be finitely representable. More specifically: 

(a) The objects of mathematics are algorithms, understood in a mathe- 
matically precise sense: algorithms can be presented as “words” in some 
finite alphabet of symbols. 

(b) Limitations due to finite memory capacity are disregarded, the length 
of symbol strings is unbounded (though always finite). 

(c) Logically compound statements not involving 4, V are understood in 
a direct way, but existential statements and disjunctions always have to be 
made explicit. 

(d) If it is impossible that an algorithmic computation does not terminate, 
we may assume that it does terminate (“‘Markov’s principle”). 


1.6. Intuitionism. Intuitionism will be understood here as the constructive 
approach to mathematics in the spirit of Brouwer (1881-1966) and Heyting 
(1898-1980). The philosophical basis of this approach is already present in 
Brouwer’s thesis (1907) but with respect to the mathematical consequences 
Brouwer (1918) is a more appropriate starting point. The basic tenets may 
be summarized as follows. 

(a) Mathematics deals with mental constructions, which are immediately 
grasped by the mind; mathematics does not consist in the formal manipula- 
tion of symbols, and the use of mathematical language is a secondary 
phenomenon, induced by our limitations (when compared with an ideal 
mathematician with unlimited memory and perfect recall), and the wish to 
communicate our mathematical constructions to others. 

(b) It does not make sense to think of truth or falsity of a mathematical 
statement independently of our knowledge concerning the statement. A 
statement is true if we have proof of it, and false if we can show that the 
assumption that there is a proof for the statement leads to a contradiction. 
For an arbitrary statement we can therefore not assert that it is either true 
or false. 

(c) Mathematics is a free creation: it is not a matter of mentally 
reconstructing, or grasping the truth about mathematical objects existing 
independently of us (this is in contrast to e.g. the French empiricists; cf. 
4.2). 

It follows from (b) that it is necessary to adopt a different interpretation 
of statements of the form “there exists an x such that A(x) holds” and “A 
or B holds”. In particular, “A or not A” does not generally hold on the 
intuitionistic reading of “or” and “not”; we return to this in section 3. 
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In agreement with, but not necessarily following from (c), intuitionism 
permits consideration of unfinishable processes: the ideal mathematician 
may construct longer and longer initial segments a(Q),...,a(m) of an 
infinite sequence of natural numbers a where a is not a priori determined 
by some fixed process of producing the values, so the construction of a@ is 
never finished: «a is an example of a choice sequence. We return to this topic 
in section 4.6. 

Unfinishable objects such as choice sequences clearly do not fit into the 
framework of CRM, the viewpoint described in the preceding subsection. 

On the other hand, “Markov’s principle”, mentioned under (d) in 1.5, is 
not considered as intuitionistically valid. In fact, it is difficult to decide the 
validity of Markov’s principle by informal analysis alone. We shall return to 
Markov’s principle in section 4.5. 


1.7. In this book we shall have frequent occasion to refer to intuitionism, 
CRM and BCM in particular. However, the present book is not written 
from the viewpoint of any particular school or trend mentioned above, 
inasmuch we shall not accept any of these views as normative for mathe- 
matics. 

We do present constructive mathematics (in the wide sense) as a legiti- 
mate part of mathematics, containing material which is mathematically 
interesting, regardless of any philosophical bias. 

Constructive mathematics, and especially its logic and its metamathe- 
matics are of considerable interest for the philosophy of mathematics. 
There are some indications that intuitionistic logic can be useful in other 
parts of mathematics too, such as topos theory (see e.g. Johnstone 1977, 
1982), or theoretical computer science (see for example Constable et al. 
1986). 


2. Constructivity 


2.1. In this section we shall attempt to illustrate the idea of constructivity 
by means of some quite elementary examples. Most of the points discussed 
will receive a more thorough treatment in later chapters. 

Which objects can be said to exist as (mental) constructions? Natural 
numbers are usually regarded as unproblematic from a constructive point 
of view; they correspond to very simple mental constructions: start thinking 
of an abstract unit, think of another unit distinct from the first one and 
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consider the combination (“think them together’). The indefinite repetition 
of this process generates the collection N of natural numbers. 

It should be pointed out that already here an element of idealization 
enters. We regard 5, 1000 and 10!” as objects of “the same sort” though 
our mental picture in each of these cases is different: we can grasp “five” 
immediately as a collection of units, while on the other hand 10!” can only 
be handled via the notion of exponentiation; 1000 represents an inter- 
mediate case. Visualizing 10!” asa sequence of units is out of the question. 
Exponentiation as an always performable operation on the natural numbers 
involves a more abstract idea than is given with the generation of N. 

Certain mathematicians, such as A.S. Esenin-Vol’pin have tried to dis- 
tinguish between “reasonably” small and “unreasonably” large numbers, 
and between different “natural number structures” according to the princi- 
ples needed to generate them. Permitting exponentiation may result in a 
bigger number sequence with a more complex structure, than in the case 
where exponentiation is not considered, although addition and multiplica- 
tion are accepted. This approach might be described as ultra-finitism. The 
name “ultra-intuitionism” has sometimes been used, but seems less ap- 
propriate. There are considerable obstacles to be overcome for a coherent 
and systematic development of ultra-finitism, and in our opinion no satis- 
factory development exists at present. In this volume we shall therefore 
refrain from any further discussion of ultra-finitism except for some re- 
marks in 4.8, 

Thus all the constructivist schools described in section 1 contain elements 
of idealization, that is to say their description of constructive mathematical 
principles contains “theoretical” elements. In particular, we regard 
Brouwer’s intuitionism and Markov’s constructivism as “theories about 
mathematics”. It is also for this reason that one cannot identify intuition- 
ism with a psychologistic approach to mathematics. 


2.2. A non-constructive definition. Let A be any mathematical statement 
which at present has been neither proved nor refuted (e.g., A =“there are 
infinitely many twin primes”). Ordinarily one permits in mathematics 
descriptions (definitions) of natural numbers such as the following 


= t if A holds, 
P 2 otherwise. 


Constructively this is unacceptable as the description of a natural number 
since, as long as the truth of A has not been decided, we cannot identify p 
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with either 1 or 2; in other words we do not know how to obtain p by 
“thinking some abstract units together”, since we do not know whether to 
stop at one unit or to go on till we have two distinct units. 


2.3. Once having accepted the natural numbers, there is also no objection 
to accepting pairs of natural numbers, pairs of pairs etc. as constructive 
objects; and this permits us to define Z (the integers) and Q (the rationals) 
in the usual way as pairs of natural numbers and pairs of integers 
respectively, each modulo suitable equivalence relations. 

Infinite sequences of natural numbers, integers or rationals may be 
constructively given by some process enabling us to determine the nth term 
(value for n) for each natural number n; in particular, one may think of 
sequences given by a /aw (or recipe) for determining all its values (terms). 

Thus a real number can be specified by a fundamental sequence of 
rationals together with a Cauchy modulus, that is to say we specify a 
sequence (7,,),, & N > Q and a sequence a € N > N (the modulus) such 
that for all k € N and all m, m’ > ak 


Tn = Tn | s 2F, 


Two such sequences (7,,),, (5,), are Said to be equivalent (equal as real 
numbers) if for all k © N there is an n € N such that for all m 


ia onal pay 


Much of the elementary theory of real numbers can then be developed in 
exactly the same way as usual; we shall do this in chapter 5. 

We now give an often quoted example of a non-constructive proof for the 
following 


PROPOSITION. There exist two irrational real-numbers a, b such that a? is 
rational. 


Proor. (/2)”” is either rational, and then we can take a = b = /2, or (/2)"” 
is irrational, and then we can take a = (/2)"*, b= 2. O 


This proof does not enable us to construct a as a real number, that is to 
say we cannot compute a with any desired degree of accuracy, as required 
by our description of what it means to be given a real number. In other 
words, we do not know how to find an arbitrarily close rational approxima- 
tion. Note however, that our objections to the proof depend on reading 
“there exist” as “one can construct”. 
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A constructive proof of the proposition above is possible, e.g. by an 
appeal to Gelfond’s theorem: if a € {0,1}, a algebraic, b irrational alge- 
braic, then a? is irrational (even transcendental). Cf. e.g. Gelfond (1960, p. 
106, theorem 2). 

As a second example of a non-constructive argument, we consider the 
classical proof of Kénig’s lemma. 


2.4, PROPOSITION (KGnig’s lemma). Let T be an infinite, finitely branching 
tree, then T has an infinite branch. 


Proof. We “construct” the infinite branch a € N — T as follows. For a0 
we take the root of 7. Suppose an to have been constructed such that an 
has infinitely many successors. Among the finite set of immediate successors 
to,-.-,¢, Of an there is at least one ¢; such that ¢; has infinitely many 
successors; take a(n + 1) = ¢,. Clearly we can continue this process inde- 
finitely and an infinite branch is generated. O 


Given some ordering of the nodes, we can make the description of @ in 
the proof more definite by taking for a(n + 1) the first ¢,; having infinitely 
many successors. Nevertheless, this “construction” of a is not constructive, 
since we do not know, in general, how to decide whether a finite sequence 
in T has infinitely many successors or not; so the proof above does not give 
us a recipe for producing an for each n. 

The preceding examples of non-constructive proofs show that the natural 
constructive reading of “A or B” and “4xA(x)” as “we can decide between 
A and B” and “we can construct an x such that A(x)” respectively does 
not agree with all the laws of classical logic. In particular, “A or 4A” does 
not hold if we read “or” as above: we have no reason to assume that we can 
always decide, for any assertion A, whether A is true or refutable. 

In actual mathematical practice, one is often interested in constructiviz- 
ing “locally”, e.g. making existential statements of a specific theorem 
explicit by estimating bounds etc. In contrast, we shall mostly investigate 
globally constructive theories, where our logic can consistently be interpre- 
ted as permitting explicit realizations of all existential statements, and all 
theories should be read with a strong constructive meaning of J and Vv. 


3. Weak counterexamples 


In 1908 Brouwer introduced for the first time “weak counterexamples”, for 
the purpose of showing that certain classically acceptable statements are 
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constructively unacceptable (Brouwer 1908). Too much emphasis on these 
examples has sometimes created the false impression that refuting claims of 
classical mathematics is the principal aim of intuitionism. 

Nevertheless, the weak counterexamples are pedagogically useful and 
may serve to motivate e.g. Kripke semantics, which we shall introduce in 
the next chapter. Before we can explain these counterexamples we must 
explain in a more systematic way the constructive interpretation of the 
logical operators. 

In discussing pure logic we shall treat “constructive” and “intuitionistic” 
as synonymous. 


3.1. Logical operations; the BHK-interpretation. In the actual building of 
constructive mathematics we do not need logic; nevertheless we find it 
convenient to use logical symbolism. We shall explain the use of the logical 
operations in a constructive context by the following stipulations (going 
back to Heyting 1934), which tell us what forms proofs of logically 
compound statements take in terms of the proofs of the constituents. 


(H1) A proof of A A B is given by presenting a proof of A and a proof 
of B. 

(H2) A proof of A V B is given by presenting either a proof of A or a 
proof of B (plus the stipulation that we want to regard the proof 
presented as evidence for A V B). 

(H3) A proof of A > B is a construction which permits us to transform 
any proof of A into a proof of B. 

(H4) Absurdity 1 (contradiction) has no proof; a proof of AA is a 
construction which transforms any hypothetical proof of A into a 
proof of a contradiction. 

(H5) A proof of VxA(x) is a construction anich transforms a proof of 
d © D (D the intended range of the variable x) into a proof of A(d). 

(H6) A proof of 3xA(x) is given by providing d € D, and a proof of A(d). 


This explanation is quite informal and rests itself on our understanding of 
the notion of construction and, implicitly, the notion of mapping; it is not 
hard to show that, on a very “classical” interpretation of construction and 
mapping, H1-6 justify the principles of two-valued (classical) logic (cf. 
E1.3.4). In clause H4 the notion of a contradiction is to be regarded as a 
primitive (unexplained) notion. 

If, in the clauses H5, H6 the domain D is “simple” enough, it may 
happen that any d in D so to speak represents its own proof of belonging 
to D (d is presented to us as an object of D). N is an example: a natural 
number is given to us as such, we do not need a separate proof of this fact. 
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For such simple domains, the reference to “a proof of d © D” in H5 and 
H6 may be dropped. The clauses H5 and H6 then read 


(H5’) A proof of VxA(x) is a construction which transforms any d € D 
into a proof of A(d). 

(H6’) A proof of 3xA(x) is given by presenting a d € D and a proof of 
A(d). 


In the sequel we shall refer to the clauses H1-6 above as the 
“Brouwer~Heyting—Kolmogorov interpretation” (BHK-interpretation, see 
5.3). 


3.2. Even if the explanations H1-6 leave a lot of questions open, they 
suffice to show that certain logical principles should be generally acceptable 
from a constructive point of view, while some other principles from 
classical logic are not acceptable. 

As a positive example, consider the statement A — (B — A), for arbi- 
trary mathematical statements A, B. There is a very general proof, obtained 
as follows. Suppose a proves A. Then Ab.a, the constant function which 
assigns a to each argument, transforms a hypothetical proof of B always 
into a proof of A; so Ab.a proves B > A. Therefore if a proves A, then 
\b.a proves B — A; and so the mapping Aa.[Ab.a], which assigns to a the 
constant mapping Ab.a, transforms a proof of A into a proof of B > 4; 
hence Aa.[Ab.a] is a proof of A > (B > A). 

Another consequence is that .L — A is generally provable: since there is 
no proof of 1, Aa.a (or any other mapping) may count as a proof of 
4 — A, since it has to be applied to an empty domain. The principle 
1—A (“ex falso sequitur quodlibet”) has sometimes been rejected as 
non-constructive (Johannson 1937); Heyting (1956, 7.1.3) regards it as an 
extra stipulation fixing the meaning and use of 1, >. 

The rejection of 1 — A means that one regards 1 as a proposition not 
known to be provable (but which may turn out to be provable, if our 
mathematical world is inconsistent). This position leads to minimal logic; 
the difference with intuitionistic logic based on H1-6 is slight (cf. 10.5.2). 

On the negative side, consider the Principle of the Excluded Middle 
(PEM, or “‘tertium non datur’’) 


PEM AV WA, 


which is generally valid in classical logic. Constructively, accepting PEM as 
a general principle means that we have a universal method for obtaining, 
for any proposition A, either a proof of A or a proof of —A, i.e. a method 
for obtaining a contradiction from a hypothetical proof of A. 
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But if such a universal method were available, we could also decide a 
statement A the truth of which has not been decided yet (e.g. the example 
of the infinitely many twin primes in subsection 2.2), which is not the case. 
Thus we cannot accept PEM as a universally valid principle on the 
BHK-interpretation. 

The preceding argument is typically a “weak counterexample”: PEM is 
not refuted, that is to say, we have not shown how to derive an actual 
contradiction from the assumption that PEM is valid. Instead, we have only 
shown PEM to be unacceptable as a BHK-valid principle of reasoning, 
since accepting it means that we ought to have certain knowledge (i.e. a 
decision as to the truth of A) which in fact we do not possess. One might 
also say that the “weak counterexample” makes the validity of PEM for the 
BHK-interpretation highly implausible. 


3.3. In fact, we cannot hope to refute any individual instance of PEM, that 
is to say we cannot find a mathematical statement A such that 4(A V — A). 
This is impossible, since —4=(A V 4A) holds universally: it is an intuition- 
istic logical law. This may be seen on the basis of the BHK-interpretation as 
follows. Suppose 


c establishes (4 V —A). (1) 
Hence, 

if d establishes A V —A, c(d) proves 1. 
Clearly, there are operations e+ ey, f+ f, such that 


e establishes A = e, establishes A V “A, 


f establishes ,A = f, establishes 4 V 4A. 


Therefore if (1) holds, g: e > c(eg) (ie. g = Ae.c(eg)) is a construction 
for A>1L=—A and h:fc(f,) (or hi=Afi.c(f,)) for nA PL. 
Therefore h(g) is a construction for 1 , so the mapping c ~ h(g) estab- 
lishes —=(A V —A). 


3.4. In the chapters dealing with examples from constructive mathematics 
we shall often appeal to a number of simple logical laws valid on the 
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BHK-interpretation; in particular the following are frequently encountered: 
A> A; 
3A @ —W4; 
(A V B) @ AAA -B; 
(A A B) @(A>-=B)e(B>—A); 
(A > B) © (44 ~ 4B) © (A > 4B) 0 (54 V B); 
(A > B) @ =(A4 V B); 
ifA—> B, then 4“B>-A, 
and with quantifiers 
43xA(x) @ VxiA(x); 
a-VxA(x) > Wx=A4(x); 
=3xA(x) @ AVxA(x). 


We leave it as an exercise to verify that these laws are valid on the 
BHK-interpretation. 

On the other hand, besides PEM, the following are examples of principles 
which are valid in classical logic but not on the BHK-interpretation: 


a4 > A; 

AAV A; 

(A > B)v (B> A); 

(a4 AaB) >A VB: 

LCAVGE) ARE 

VxA(x) ~ =AVxA(x); 

374.xA(x) > 3x-4A4(x); 

(A > 3xB(x)) ~ dx(A > B(x)) (x € FV(A)). 


We shall encounter methods to demonstrate their non-validity in later 
chapters (sections 2.5, 4.3, 4.6). 

In the standard formalism, IQC, for intuitionistic predicate logic, dis- 
cussed in the following chapter, terms are supposed to be always defined, 
that is to say terms always denote something, just as in the usual for- 
malisms for classical predicate logic. However, in intuitionistic mathematics 
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possibly undefined terms or expressions which are well-defined only for 
certain values of the parameters, crop up naturally (e.g. x~} for reals x); 
while the classical way out, namely to guarantee that such expressions 
always denote something by means of arbitrary stipulations, is not available 
here (more about this in 5.3.11). Thus some care is needed in handling such 
expressions; if we use Er for “t is well-defined” (corresponding to t | in 
recursion theory), we have in particular 


WxA(x) A Et > A(t), , 
A(t) A Et > 3xA(x). 0) 


Replacing the usual principles VxA(x) — A(t) and A(t) — 3xA(x) by the 
more cautious (1) corresponds on the formal level to the E*-logic of 2.2.3. 
In handling possibly undefined expressions in our treatment of parts of 
constructive mathematics in chapters 5—8 we shall assume (1). 


3.5. Undecidability of equality for the reals. A more sophisticated weak 
counterexample is the following. Let A(n) be a predicate of natural 
numbers, such that A(n) is decidable, but the truth of VnA(n) is unknown, 
i.e. 


Vn(A(n) V =A(n)), 


1 
2WnA(n) V ~WnA(n). 0) 
At the time of writing this section we might take “Fermat’s last theorem” 
as an example of such A, putting A(n) = Vm,m,m,k(m, + m,+m,+k 
<n—>(m, + 1)k*3 + (m, + 1)*3 + (m, + 1)**3), where m,, mz, m3, 
k, n range over N. 
Suppose now that we define a real number x 
sequence of rationals (7,4), defined as follows: 


4 via a fundamental 


n 


1 [27" ifvk <nA(k), 
res 
2-* if A(k) Ak <n A Vk! < kA(k’). 


It is easy to see that this is indeed a fundamental sequence, since 
Vn > m(|r4 — r4] < 27™); also 


x4=0 6 WnA(n). 


The direction from right to left is obvious; conversely, if x4 = 0, then 
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ra, < 27” necessarily follows for all n, hence A(n) for all n. Therefore 


x4=O0Vxt#0 


is equivalent to VnA(n) V =VnA(n). Thus we have a weak counterexample 
to the decidability of equality between reals; this example, in its turn, can 
serve for the construction of weak counterexamples to many other state- 
ments, as we shall see below. 


3.6. Let us call a set finite iff it is in 1-1 correspondence with an initial 
segment of N, and consider for example the following subset of the reals: 


X= {0, x4}. 


X is the image of the finite set {0,1} (under the mapping ¢ given by 
pO = 0, pl = x“), or in other words, X is finitely indexed (namely by 
{0, 1}). However, we cannot show X to be finite, since this would require us 
to decide whether it has one or two distinct elements, and this in turn 
would require us to decide WnA(n) V —WnA(n). 

Another consequence of the weak counterexample to x = 0 V x # 0 is, 
that the usual examples of discontinuous functions from R to R fail to yield 
total functions. For example, the function f specified by 


f(x) =1 ifx#0 
f(x) =0 ifx=0 


is classically everywhere defined, but not so constructively. For if we 
suppose f to be constructively defined everywhere on R, we ought to be 
able to construct arbitrarily close approximations to f(x) for any argument 
x; but this means in particular that we should be able to decide 


f(x4) <1 Vv f(x4) > 0, 


hence 4x4 #0 V x4 #0. This. is equivalent to x4 =0 V x4 #0: for 
decidable A(n) one readily sees ~4A(n) © A(n), and since x4 =0 0 
VnA(n), a-WnA(n) > Wn—— A(n), we see that 4x4 # 0 © x4 = 0. Since 
x4 =0 Vv x4 # 0 is undecided, we also cannot assert that f is everywhere 
defined on R. 

At first sight, it seems as if the failure of decidability of equality between 
the reals must have disastrous consequences for the development of real 
analysis. That this is not the case, is due to the fact that in most cases it 
suffices to make decisions which can be made from sufficiently good 
approximations. Thus, instead of using x = 0 V x # 0, we can often use 
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the fact that for reals x, y, z, if x <y, then (z < y) V (x <z) can be 
decided for all z: just choose sufficiently close rational approximations to x, 
y, and z. 

In the weak counterexample of 3.5 A(n) may be any predicate for which 
our state of knowledge is represented by (1) of 3.5. 

We may reformulate 3.5 in terms of (characteristic) functions: let a € 
N — WN and let (r*), be a fundamental sequence determining x* € R, 
defined by 


7 27" if Wk <n(ak = 0), 

r. = 

7 2-* ifeak #OAk <nAWk' <k(ak’ =0); 

thus x“ as defined in 3.5 corresponds to x% if a is the characteristic 
function of A. We have x* = 0 © Vn(an = 0), and for a suitable choice of 
a x*=0 V x* #0 is unknown; but as we saw in 3.3 we cannot show 
(Vn(an = 0) V ~=Vn(an = 0)). On the other hand, it is possible to obtain 
proper refutations of the statement “PEM is generally true”. For the 
universal validity of PEM implies in particular 


V-PEM = Va[Wn(an = 0) V =Vn(an = 0)], 
or equivalently 
Va[Wn(an # 0) V =Wn(an # 0)], 


and as we shall see in chapter 4, both in constructive recursive mathematics 
and in traditional intuitionism certain principles hold which exclude a 
classical interpretation of the logical operators, and in particular imply 
—V-PEM. 

In CRM —V-PEM is a consequence of the assumption that all operations 
have to be algorithmic; in intuitionism —V-PEM is the result of the 
admission of choice sequences as legitimate mathematical objects. 

Thus the reasons for accepting =V-PEM are quite different in these two 
cases: in the first case .V-PEM is the result of a severe restriction on the 
range of possible operations, in the second case it is the result of enlarging | 
the domain of functions in N — N from predeterminate functions given by 
a law to the much wider class of functions determined by successive (more 
or less free) choices. 


3.7. Splitting of notions. Since the BHK-interpretation causes us to reject 
certain classically valid logical principles such as PEM, a splitting of 
notions results: classically equivalent definitions are not always equivalent 
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on the basis of constructive logic, as shown by the example of “finite” and 
“finitely indexed” in the preceding subsection. Constructively, one has also 
to distinguish between “finitely indexed set” and “subset of a finite set” 
(E1.3.2). 

Yet another example is the distinction between “non-empty set” and 
“inhabited set”: X is non-empty iff —W4x(x © X), and inhabited iff 
Ax(x € X). 

A more delicate example is the distinction between inequality and 
apartness between reals: x # y simply means that x and y are not equal, 
but x # y (“x is apart from y”) means that there is a positive distance 
between x and y: 


x#y = 3re Q(x -—y| >r>0). 


Intuitionistically these notions are distinct, but not in CRM (cf. 4.5.4). 
One might expect that the splitting of notions leads to an enormous 
proliferation of results in the various parts of constructive mathematics 
when compared with their classical counterparts. In practice, usually only 
very few constructive versions of a classical notion are worth developing, 
since other variants do not lead to a mathematically satisfactory theory. 


4. A brief history of constructivism 


The conscious practice of constructive mathematics is a relatively young 
phenomenon. Before the introduction of highly abstract notions, going 
hand in hand with abstract proofs, i.e. proofs that went beyond well-trusted 
tangible procedures, there was no need for a separate domain of the 
constructive. Of course, with hindsight, we can locate non-constructive 
proofs in the older mathematics but it would be a historical injustice to 
project our mathematical culture on our forefathers. 

To mention one example that from our point of view personifies non- 
constructive reasoning, proof by contradiction was universally accepted and 
employed. There is one particular domain where proof by contradiction is 
almost a trademark: geometry. From Euclid’s Elements to Hilbert’s 
Grundlagen proof by contradiction is required to set the deductive machin- 
ery working. It seems plausible that partly on the authority of Aristotle’s 
logic, partly on idealized and imperfect knowledge of the mathematical 
universe, the principle of the excluded middle in particular for identity, but 
more generally for arbitrary notions was taken for granted. 


Sect. 4] A brief history of constructivism 17 


In this chapter we will sketch some of the main trends in constructive 
mathematics. We will outline the basic ideas and the underlying philoso- 
phies, historic details of specific mathematical or logical notions or develop- 
ments are to be found im the final section (“notes”) of each chapter. 

In our exposition we have as a rule adopted the terminology that prevails 
today, the reader should be aware however that in the older literature terms 
like neo-intuitionist (for intuitionist), empiricist (for semi-intuitionist), etc. 
occur. 


4.1. Kronecker and the arithmetization of mathematics. A first clearcut 
manifest of constructive mathematics was issued by Kronecker (1823-1891), 
best known for his work in number theory and algebra. 

Kronecker’s constructivism is most easily recognized in his views on the 
status of certain mathematical objects. In his essay “Uber den Zahlbegriff” 
(1887) he outlined the project of “arithmetizing’” Algebra and Analysis; 
that is, to found these disciplines on the most fundamental notion of 
number. He explicitly, with a reference to Gauss, restricted this arithmetiza- 
tion to analysis and algebra; geometry and mechanics were left an indepen- 
dent existence, as not being purely determined by our mind, but referring to 
a certain reality outside our mind. 

In a number of lectures Kronecker developed a unified theory of num- 
bers, posthumously published by Hensel (Kronecker 1901), in which num- 
bers, congruences, polynomials and rational functions are treated on an 
equal footing. In the spirit of this tradition he obtains the integers from the 
natural numbers by considering N[x] mod (x + 1). 

Similarly rational numbers are introduced by considering congruences 
(Kronecker 1887, p. 261). 

The introduction of algebraic numbers is also based on modulus arith- 
metic, a procedure that is now universally adopted. One consequence of 
Kronecker’s arithmetization project was that he considered a definition (in 
number theory and algebra) acceptable only if it could be checked in a 
finite number of steps whether a given number falls under it or not. This 
viewpoint naturally led to his criticism of “pure” existence proofs. He 
stated that an existence proof for a number could only then be considered 
to be totally exact if it also contained a method to find the number whose 
existence was proven, (Kronecker 1901, p. vi). In Kronecker’s published 
works there is a minimum of philosophizing; apart from a few remarks on 
the nature of his programme he consistently sticks to actual mathematical 
practice. Some of his remarks belong to mathematical folklore, and they 
capture fairly well his basic outlook. Best known is his widely reported 
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statement that, “the Lord made the natural numbers (ganze Zahlen), 
everything else is the work of men”, and the following statement reiterates 
the same idea: “I consider mathematics only as an abstraction of the 
arithmetical reality”. 

Kronecker’s programme was taken up by Jules Molk. In a long paper 
(1885) he makes a few remarks along the line of Kronecker. On the topic of 
definitions he says that “definitions should be algebraic and not just logical. 
It is not sufficient to say: something is or it is not”. He points out that a 
logical definition, like “an irreducible function is by definition not reduci- 
ble” is not much help. In algebra a definition of irreducible should allow 
one to test in a finite number of steps that a function does not factorize in 
nontrivial factors. Only such a test will give meaning to the words reducible 
and irreducible. 

It must be pointed out that Molk does not contest the truth of statements 
like “a bounded set of numbers has a least upper bound”, he views this 
statement as a logical truth that does not belong to algebra. In the light of 
Kronecker’s remarks on the nature of mathematical objects and proofs it 
seems likely that Kronecker would reject the above statement as meaning- 
less. 


4.2. The French semi-intuitionists. The next stage in the evolution of con- 
structive mathematics was the discussion about Zermelo’s proof of the 
well-ordering theorem. Zermelo’s use of the axiom of choice in its full 
strength raised a mixed variety of objections. In particular a group of 
French mathematicians expressed criticism based on more or less clearly 
articulated constructive views. We will not go into the details of the 
axiom-of-choice controversy, but instead concentrate on some of the con- 
structivistic issues. The main representatives of constructive tendencies in 
this controversy were Baire, Borel, Lebesgue, Lusin and Poincaré. They did 
not offer a definite philosophical doctrine, but rather a mixture of views on 
mathematics with constructive tendencies. 


Poincaré (1854—1913). Poincaré’s views are well-known, he forcefully at- 
tacked set theory (Cantorism) and “logistics” (i.e. mathematical logic). 
Strictly speaking one cannot count him as a constructivist; his direct 
influence on later developments in constructive mathematics is mostly 
restricted to his views on intuition closely tied up with his criticism of logic. 
He was sceptical of both Aristotelian logic ~ “a syllogism cannot teach us 
anything essentially new” (1902, chapter I)-and the newer variants of 
Russell, Frege and Peano. 
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Poincaré argues that in mathematics one needs more than logic, one 
needs intuition. He mentions three kinds of intuition (1905, chapter I): the 
appeal to the senses and the imagination, generalization by induction, and 
finally the intuition of pure number. The latter one guarantees the exactness 
of mathematics. Mathematics has its specific methods of proof, and the 
method par excellence is the principle of induction, which is according to 
Poincaré a synthetic judgement a priori (“Les derniers efforts des 
logisticiens”, in 1908). 

In a number of ways Poincaré is a forerunner of the later constructivists: 
his recognition of the principle of mathematical induction as an unprovable 
fact, his criticism of the “creative role” of logic, his rejection of the 
Cantorian actual infinite and his embracement of intuition in mathematics. 

One should not, however, expect too much consistency in Poincare’s 
philosophical writings, (cf. Mooij 1966); in his essay “Les mathématiques et 
la logique” (in 1908) he defended, for example, the rather platonistic view 
that “Existence can mean only one thing: freedom from contradiction”. 

The paradoxes around the turn of the century were, in his view, caused 
by impredicative reasoning, vicious circles. In a number of publications he 
defended a predicative version of mathematics. Discussing Richard’s para- 
dox, he wrote “E is the set of all numbers one can define with finitely 
many words, without introducing the set E itself. Without this (restriction) 
the definition of E will contain a vicious circle; one cannot define E through 
the set E itself” (“Les derniers efforts des logisticiens”, in 1908). 

This is the familiar syntactic formulation of the predicative restriction. In 
“La logique de linfini” (in 1913) he provided a mathematical criterion: the 
definition of a set should be such that adding more elements to the universe 
under consideration does change the set. 


Emile Borel (1871-1956). Emile Borel was the most pronounced spokesman 
of the semi-intuitionists, he was closer to Kronecker’s position on a number 
of issues than Poincaré. He maintained, for example, that only effectively 
defined (i.e. by finitely many words) objects exist in the domain of science 
(1914, p. 171, 175), consistency does not suffice for existence. However, 
compared to Kronecker’s “ verifiability in finitely many steps” this “finite 
definability” is clearly a more liberal notion. It allows one, for example, to 
consider the functions f with f(x) = 0 for rational x and f(x) =1 for 
irrational x, as well-defined and existing. 

Lebesgue shared Borel’s views on this point. From Borel’s point of view 
individual real numbers had to be given by a finite definition, so as a set 
they could never get beyond the countable. Realizing how awkward a 
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countable continuum would be, Borel considered the continuum as inde- 
pendently given by intuition: “we accept that the collection c of real 
numbers between 0 and 1 is given, without investigating how they could 
effectively be given” (Borel 1914, p. 16); he spoke of the geometric con- 
tinuum. The same view had already been defended by Hélder in 1892 (cf. 
1924, p. 194) and Brouwer independently elaborated it in his Ph.D. Thesis 
(1907). 

This concept of geometric continuum yields a “whole” that precedes the 
individual elements, and cannot be thought of as“‘the set of its reals” (Lusin 
1930). For the practice of mathematics Borel nevertheless wished to have 
available a sufficiently large part of the continuum that could be considered 
as “made up of individual real numbers’. For this purpose he introduced 
the practical continuum (1914, p. 166), consisting of the finitely definable 
numbers. 

The discussion about Zermelo’s proof of the well-ordering theorem had 
two aspects that are relevant from the constructive viewpoint: the well- 
ordering of the continuum and the axiom of choice. The first aspect gave 
rise to reflections on the nature and structure of the continuum, the second 
one gave rise to reflections on the notion of “infinite succession of choices”. 
The topic of legitimacy of sequences of choices was taken up by Borel (cf. 
1914, p. 168). We will return to it in the context of intuitionism. 

Borel’s philosophical position may be inferred from the following quota- 
tion (1914, p. 173): 

“J do not understand what the abstract possibility of an act, which is 
impossible for the human mind can mean. For me this is a purely 
metaphysical abstraction without any scientific reality”. 

The foundational practice of the semi-intuitionists was mostly carried out 
in the margin of their main mathematical researches. One cannot say that a 
systematic philosophical (or foundational) basis for their constructive con- 
victions had been put forward. In general the constructivistic activities of 
the semi-intuitionists remained “a gentleman’s pastime”; their views have, 
however, played a role in the subsequent development of constructive 
mathematics. 


4.3. Brouwer and intuitionism. In the early years of the century Brouwer 
(1881-1966) had formed his views on mathematics, both pure and applied. 
In 1907 he published his dissertation “On the foundations of mathematics”. 
Since the dissertation was written in Dutch (an English translation ap- 
peared only in his “Collected Works” (1975)) the impact was marginal. 
Nonetheless it is worthwhile to discuss it since it contains much that is 
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basic for understanding his later and riper views. Leaving aside the purely 
mathematical topics we find three main themes: the continuum, mathe- 
matics and experience (applications), mathematics and logic. 

Brouwer recognized in his thesis—as far as we can see, independently 
from Borel — the continuum as a primitive notion given directly by intui- 
tion. 

Brouwer rejected the reduction of the continuum by means of any of the 
approximation methods of irrationals by rationals, since he considers these 
approximations as a process that unfolds in stages and hence at each stage 
remains denumerable (observe the analogy with Borel’s continuum, but 
note that Brouwer was probably not aware of Borel’s views until the 
international congress of mathematicians in 1908 in Rome). 

Brouwer’s view of mathematics is basic for the understanding of the 
development of intuitionism, his views on existence, mathematical objects, 
logic, language are all consequences of this characterization of mathe- 
matics: 

Mathematics is a free creation of the mind, it allows the construction of 
mathematical systems on the basis of intuition. Mathematical objects, 
hence, are mental constructs. 

Some consequences drawn by Brouwer are: 

(1) From the Kantian a priori only the “intuition of time” remains. 

(2) Mathematics is independent of logic; logic is an application of 
mathematics. 

(3) Mathematics cannot be founded upon the axiomatic method. 

In particular Brouwer rejected in his thesis Hilbert’s (early) formalism 
(1905) and Cantorian set theory. Brouwer’s criticism of certain parts of 
traditional mathematical practice has led to the unfortunate mistaken view 
that Brouwer’s “revolution” was mainly a negative one (cf. Hilbert 1922). 
Actually Brouwer introduced a number of innovations that lifted his 
intuitionistic'mathematics well above the level of securing some safe corners 
in the building of mathematics. 

The concept that is most closely tied to Brouwer’s name is that of “choice 
sequence”. Sequences of numbers determined by choice had already oc- 
curred in the literature before Brouwer’s first exposition in 1918. Borel had 
discussed them in relation with Zermelo’s paper on the well-ordering 
theorem (1914, pp. 168-169), and he had pointed out that in order to 
recover the “complete arithmetical concept of the continuum” one needs 
countable sequences of choices (1914, p. 161). Before him Du Bois Reymond 
had already considered real numbers with decimals determined by throwing 
a die (1882). However, with Borel and Du Bois Reymond these sequences 
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remained curiosities. It was Brouwer who discovered how to exploit the 
choice feature in practical mathematics. After rejecting choice sequences as 
non-intuitionistic (1912), he came to accept them (1914), and in 1918 he 
published his first basic intuitionistic paper in which the notion of choice 
sequence was a corner stone. In that paper his continuity principle was also 
enunciated and immediately used to show the non-denumerability of N‘, 
for a historical discussion cf. Troelstra (1982A). 

Further refiection on the nature of choice sequences led Brouwer to a 
proof of the “uniform continuity theorem” (i.e. all functions from [0,1] to 
R are uniformly continuous). Analysis of the proof shows that it is based on 
yet another intuitionist principle, the principle of Bar Induction (cf. section 
4.8, and 4.10.5). 

The negative aspects of Brouwer’s foundational activities are to be found 
in his criticism of the mathematical practice of his day. From his point of 
view, which characterized mathematics as an inner, mental constructional 
activity, language and logic were secondary phenomena. Language was a 
highly fallible tool for communication, not to be trusted as a source of 
insight, and logic was a product derived from mathematics. For Brouwer’s 
views on language cf. (1907, 1929, 1949). 

As far as logic was concerned, Brouwer had rather conservative views. 
Sometimes he understood by logic the traditional Aristotelian theory of 
syllogisms, sometimes the logic of Peano and Russell. 

Mathematical reasoning, according to Brouwer, consists of constructing 
mathematical structures, and the frequent appearance of chains of applica- 
tions of logical rules (syllogisms) is a practice that is only justified by 
mathematical constructions accompanying each application of a rule. The 
above applies to intuitive logical arguments. Theoretical logic, according to 
Brouwer, is an application of mathematics, and an empirical science; it will 
never teach us anything concerning the organization of the human intellect. 

We highlight a few of Brouwer’s views. 

On quantification (1907, cf. 1975, p. 76): 

“the mistake which so many people, thinking that they could reason 
logically about other objects than mathematical structures built by them- 
selves, and overlooked, that wheresoever logic uses the word all or every, 
this word, in order to make sense, tacitly involves the restriction: insofar as 
belonging to a mathematical structure which is supposed to be constructed 
beforehand”. 

On consistency (in particular Hilbert’s early formalism, 1905). Brouwer’s 
comments are roughly as follows (cf. 1975, p. 78): Not recognizing any 
intuitive mathematics, the consistency of (e.g.) number theory has to be 
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proved in a linguistic framework by means of axioms (among which 

certainly the axiom of complete induction), next one has to prove the 

consistency of this system. However, 

(1) this does not constitute any progress over the previous stage, 

(2) consistency does not imply the existence of an accompanying 
mathematical structure, 

(3) even if the mathematical system of reasoning exists, this does not 
entail that it is alive, i.e. that it accompanies a sequence of thoughts, and 
even if the latter is the case this sequence of thoughts need not be a 
mathematical development, so it need not be convincing. 

On set theory: Cantor’s justification of the existence of sets given by 
comprehension (or even separation) is incorrect. For a number of important 
notions in set theory no existence grounds exist, e.g. the second number 
class and NF do not exist. (1975, p. 81, 87). 

On the levels of meta-theory: In the logico-linguistic treatment of mathe- 
matics one can discern certain stages (1975, p. 174): 

(1) The construction of pure mathematical (mental) systems. 

(2) The linguistic description of mathematics. 

(3) The mathematical study of (2), including the logical structure. 

(4) Abstraction from the meaning of the expressions of (3), and of the 
logical operations, thus creating a second-order mathematical system 
(this is the system of the logicists). 

(5) The language of (4). 

(6) The mathematical study of (5) (a step made by Hilbert, but not by 
Peano and Russell). 

(7) Abstraction from the meaning of the expressions of (6), thus creating a 
mathematical system of the third order. 

(8) The language of (7) (this is the last stage considered by Hilbert). Etc. 

The well-known, and somewhat notorious issue of the principle of the 
excluded middle (PEM) has sidetracked many of Brouwer’s contemporaries 
into believing that intuitionism was primarily seeking a revision of the 
logical apparatus of mathematics. The title of Brouwer’s first full scale 
exposition (1918), may very well have fostered this misunderstanding. 

In the paper “The unreliability of the logical principles” (1908), Brouwer 
drew the inevitable consequence from his views on mathematics and refuted 
PEM using an undecided property of the decimals of 7. 

Brouwer himself never elaborated in writing the meaning of the logical 
constants, that step was eventually made by Heyting and Kolmogorov, but 
in his writings a “proofs as constructions”-concept is clearly implicit (cf. 
1907, chapter III; 1908). 
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In the late twenties Brouwer started to look for a way to exploit the 
peculiar properties inherent in the creative subject, the results appeared in 
print only after the second world war, (1948, 1949). The part of intuitionism 
based on the creative subject has remained rather marginal. 

Brouwer’s programme of reconstructing mathematics along intuitionistic 
lines found few adherents; a considerable part of the programme was 
carried out by his principal student A. Heyting, whose best-known contri- 
bution to intuitionism was the formalization of intuitionistic logic (and 
arithmetic) and the formulation of a natural semantics for it. This semantics 
is the BHK-interpretation of section 3; it is sometimes called the proof 
interpretation (Heyting 1934 and 1.5.3 below). 


4.4. Hermann Weyl (1885-1955) and predicativity. The tradition of Poin- 
caré and Russell, which laid the blame of inconsistencies and paradoxes on 
impredicative definitions, was taken up in 1918 by Weyl, who published a 
monograph, “Das Kontinuum” (1918). Like his predecessors, Weyl was out 
to avoid vicious circles; he rebuilt analysis using only a very restricted kind 
of sets and functions. Indeed, he only allowed arithmetically definable sets, 
i.e. sets defined by comprehension over formulas without bound set vari- 
ables. Weyl’s reconstruction was quite successful, a large portion of tradi- 
tional mathematics was recovered. 

Weyl’s philosophical position in this monograph (and in foundational 
matters in general) was that of Husserl’s Phenomenology. 

However, after learning of Brouwer’s intuitionistic foundation of 
mathematics he gave up his own programme and embraced intuitionism. In 
“Uber die neue Grundlagen Krise der Mathematik” (1921) he unequiv- 
ocally sided with Brouwer in the latter’s revolution, but later in life he 
dissociated himself from intuitionism and foundations in general. Lorenzen 
1955 can be viewed as a continuation of Weyl’s original programme. 

Both Weyl and Lorenzen were concerned with the actual revision of 
mathematical practice. Later work in predicativism was mainly of a meta- 
mathematical nature (cf. Feferman 1964, 1978, Kreisel 1960). So far predi- 
cative mathematics has remained a curiosity for logicians rather than a 
realistic alternative for the working mathematician. Recently, however, 
there has been a renewed interest in proof theoretically weak systems with 
strong expressive power, (Takeuti 1978, Friedman 1980, Friedman et al. 
1983), e.g. as in Friedman’s “reverse mathematics”. In fact, a surprisingly 
large body of mathematics can be developed in quite weak systems, much 
weaker than so-called predicative analysis. In many cases even conservative 
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extensions of first-order arithmetic allow the development of a good deal of 
traditional mathematics, in particular analysis. 


4.5. Finitism. Brouwer’s intuitionism contained unmistakable abstract no- 
tions; in particular sets (species), choice sequences, spreads are abstract 
objects in mathematics proper. The use of abstract notions has been 
recognized and criticized by a number of mathematicians, e.g. Kronecker, 
Skolem and Goodstein. They proposed a very narrow version of construc- 
tive mathematics, one in which only concrete combinatorial operations on 
(strictly) finite mathematical objects are allowed. The prototype of “finite 
mathematical object” is “natural number” and a good example of a 
“combinatorial operation” is provided by a “table of multiplication”. 

Skolem, already in 1919, systematically considered what kind of combi- 
natorial operation on the natural numbers can be introduced (1923). 

He developed a sizable part of primitive recursive arithmetic, i.e. he 
studied primitive recursive functions avant la lettre in a quantifier-free 
system. In fact, he worked in an informal mathematics, but he initiated a 
long tradition in the theory of primitive recursive arithmetic, cf. Goodstein 
(1957). 

Skolem explicitly adopted Kronecker’s foundational viewpoint: “a 
mathematical definition is a genuine definition if and only if it leads to the 
goal by means of a finite number of trials” (1923). As far as the founda- 
tions of mathematics are concerned, Kronecker is a forerunner of finitism. 

The restricted combinatorial view also turned up in Hilbert’s programme. 
Hilbert, in his improved formalist philosophy considered the finitistic, 
combinatorial mathematics as the truly meaningful part of mathematics. 
Mathematics dealing with the actual infinite, e.g. full arithmetic, was to be 
reduced to the finitary mathematics (arithmetic). This is the basic idea of 
Hilbert’s programme: ideal statements are to be granted a meaning in terms 
of finitary statements. Finitistic mathematics, of course, is completely 
constructive and hence a part of intuitionistic mathematics. Some early 
writers on the foundations of mathematics have confused finitistic mathe- 
matics and intuitionistic mathematics, e.g. Herbrand (1931). 

After Gédel’s demonstration that classical arithmetic could be reduced to 
intuitionistic arithmetic via his “negative translation”, (1933), it was clear 
that even intuitionistic arithmetic went beyond finitism. 


6 


4.6. Constructive recursive mathematics and Markov’s School. Another con- 
cept of constructivity, based on the theory of algorithms or recursive 
functions, has presented itself in the thirties. This development, however, 
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had a totally different philosophical motivation. At first primitive recursive 
functions and recursive functions were introduced as tools for meta- 
mathematical research. Gédel used (what we now call) primitive recursive 
functions in his proof of the incompleteness theorem (1931), and as pointed 
out above, Skolem had already in 1919 developed the basis of primitive 
recursive arithmetic (without introducing the class of primitive recursive 
functions explicitly) (Skolem 1923), Elaborating a suggestion of Herbrand, 
Gédel introduced the wider class of recursive functions in (1934). 

Along completely different lines Church introduced the A-calculus (1932), 
a theory that only used A-abstraction and application. In this theory the 
natural numbers could be represented and Kieene showed that the recursive 
functions were A-definable. 

Curry, in the meantime, had developed combinatory logic (1930), follow- 
ing the early work of Schdnfinckel. 

The equivalence of A-calculus and combinatory logic was established by 
Rosser, and the equivalence of the Herbrand—Gédel recursive functions 
with the A-definable ones was proved by Church (1936) and by Kleene 
(1936). 

An interesting conceptual characterization of algorithms was undertaken 
by Turing, who used abstract machines, now known as Turing machines 
(Turing 1937). Turing analyzed the intuitive concept of “mechanically 
computable by a human operator” and put forward a convincing argument 
that “computable” coincides with “computable by a Turing machine”. 
Church had also stated in 1936 a characterization of ““computable”, but in 
terms of recursiveness, he equated “computable” and “recursive” (1936). 
This identification of “effectively computable” and “recursive” has become 
known as Church’s Thesis. 

The theory of (partial) recursive functions has become a cornerstone of 
metamathematics. The reader will find more information on the historical 
details in the relevant chapters. 

Mathematics on the basis of Church’s Thesis was propagated by Markov, 
who founded a school of constructive mathematics in Moscow. Markov laid 
down the following principles for constructive mathematics: 

(1) the objects of constructive mathematics are words in various al- 
phabets. 

(2) abstraction through potential realizability is allowed, but not abstrac- 
tion through the actual infinite. 

(1) must be understood as including e.g. recursive functions, as given (or 
coded) by a set of equations or an index. An example of a potentially 
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realizable process is the addition of natural numbers, it does not involve the 
actually infinite set of natural numbers, but an algorithm that for any given 
n and m produces (a representation of) n + m. 

Markov formulated his constructive mathematics in terms of normal 
algorithms, also known as Markov algorithms (cf. Markov 1954A, Mendelson 
1964). We shall not use Markov’s formalism, but use the more familiar 
recursive functions. 

A consequence of Markov’s view is that number theoretic functions are 
recursive; that is to say the potentially realizable version of this “infinite” 
object must allow a representation by an index. In this sense Markov’s 
constructive mathematics can be viewed as a theory based on Church’s 
Thesis. 

On the one hand Markov is more strict than the intuitionists, e.g. he 
rejects choice objects, on the other hand he is more liberal. This is 
illustrated by his argument for justification of his principle of constructive 
choices (Markov 1962), nowadays called Markov’s Principle. (Markov 
formulated his principle in lectures in Leningrad in 1952/1953, it appeared 
in print in Markov 1954). 

In familiar terms, the principle states that if it is not the case that a 
Turing machine does not halt on a given input, then it does halt. Markov’s 
argument, in terms of Turing machines, is simply as follows; if it is 
impossible that the Turing machine will compute forever, then there is a 
clear algorithm for obtaining the output: just continue the process until it 
halts. Markov added: 

“TI know that intuitionists reject this method of argument, since they do 
not consider it ‘intuitively clear’. In connection with this I consider it 
necessary to make the following remarks. Firstly, my intuition finds this 
sufficiently clear. On the other hand, I can in no way agree to taking 
‘intuitively clear’ as a criterion for truth in mathematics, for this criterion 
would mean the complete triumph of subjectivism and would lead to a 
break with the understanding of science as a form of social activity. If I 
defend this means of argument here, it is not because I find it without error 
according to my intuition, but rather, firstly because I see no reasonable 
basis for rejecting it, and secondly, because arguments of this type make it 
possible to construct a constructive mathematics that is well able to serve 
contemporary natural science” (Markov 1962). 

As Markov indicates, his principle is not accepted by the constructivists 
at large; the principle has, however, found its place in the metamathematics 
of constructive mathematics (see e.g. sections 3.5, 4.5). 
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4.7. Bishop and constructive mathematics for the working mathematicians. 
The landscape of constructive mathematics suddenly changed in 1967 when 
Bishop (1928—1983) published his “Foundations of constructive analysis”. 
“This book, he said, had a threefold purpose: first, to present the construc- 
tive point of view; second, to show that the constructive program can 
succeed; third, to lay a foundation for further work. These immediate ends 
tend to an ultimate goal—to hasten the inevitable day when constructive 
mathematics will be the accepted norm”. Bishop’s message was that the real 
business of constructive mathematics is mathematics; it came at a time 
when constructivism was mostly tending its foundations, and it was pre- 
sented by a first-rate mathematician. 

In contrast to Brouwerian intuitionism (even after Heyting’s presenta- 
tion), Bishop’s nrathematics used the language of the working mathemati- 
cian. He attacked mathematics much in the spirit of Kronecker, be it that 
he introduced abstract objects in a more direct way. 

A few examples may help to illustrate Bishop’s position. 

In his view the only way to show that an object exists is to give a finite 
routine for finding it. 

Sets are introduced by him as “the totality of all mathematical objects 
constructed in accord with certain requirements” and they come with a 
given equivalence relation called “equality”. 

The basic higher type objects are operators, i.e. rules which assign 
elements of a set A to elements of a set B. 

Such a rule f must afford an explicit, finite mechanical reduction of the 
procedure for constructing f(a) to the procedure for constructing a. A 
function then, is an operator that respects equality. 

Bishop does not specify the notion of “rule”, but throughout rules could 
be taken to be recursive; that is, Bishop’s mathematics is compatible with 
Church’s thesis. 

The fundamental thesis of Bishop’s mathematics is that “all mathematics 
should have numerical meaning”, which is closer to Kronecker (or even 
Hilbert) than to Brouwer. Bishop foregoes the abstract objects of classical 
mathematics as well as those of the intuitionists. His mathematics rests on 
neutral ground, without ontological or idealistic objects and principles. 
Thus he cannot, like Brouwer, show that all real functions are continuous, 
but as a pragmatist he restricts his attention to continuous functions. 

The reader can see for himself how actual mathematics is done construc- 
tively in Bishop’s instructive book. Bishop had a number of followers, and 
at present a number of mathematicians are constructivizing mathematics 
along the same lines as Bishop. 
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The work of Bishop also raised a number of metamathematical questions, 
the main question being: which principles suffice to found Bishop’s 
mathematics on. There are to date two main approaches to the problem: a 
set theoretical one, proposed by Myhill and Friedman, and an “operations 
and classes” one, proposed by Feferman. The reader may consult chapter 9 
and Beeson'(1985). Bishop’s constructive mathematics was intended as the 
neutral part of constructive mathematics and as compatible with traditional 
classical mathematics. Metamathematical analysis has born this out, one 
can (in principle) extend his mathematics both in the direction of Brouwer’s 
intuitionism and in the direction of recursive mathematics @ Ja Markov. 


4.8. Esenin-Vol'pin’s ultra-finitism. In the opinion of a number of mathe- 
maticians Brouwer’s criticism of traditional mathematics stopped short of 
the potential infinity of mathematics — in particular, of the natural number 
sequence. Borel, for instance, declared that “the very large finite offers the 
same difficulties as the infinite”, Borel (1947), and van Dantzig (1956) posed 
the question “Is 10!” a finite number?” in the tradition of Mannoury 
(1909). Wittgenstein also shared the view that mathematics has no business 
with an idealized infinite set of natural numbers. The first to develop the 
strict finitist view beyond a collection of philosophical reflections was A.S. 
Esenin-Vol’pin. From 1959 onward he undertook the founding of mathe- 
matics on a strictly finite basis. He challenged the view that there is, up to 
isomorphism, a unique sequence of natural numbers. Taking a realistic view 
of natural numbers, there does not seem to be a good reason to assume that 
the successor operation can indefinitely be iterated. So there may be 
universes of natural numbers that do not have the same length. As a 
consequence of the limitations as to the natural numbers that can actually 
be reached, the universe(s) of natural numbers may not be closed under the 
common operations, such as multiplication or exponentiation. The strictly 
finitist view also has its consequences for logic; the derivations of A and 
A — B may still be within reach, but in order to apply modus ponens one 
might have to exceed the available natural numbers necessary for the length 
of the derivation of B. In a sense Esenin-Vol’pin’s program is even more 
restrictive than Hilbert’s finitism, therefore “ultra-finitism” is a more fitting 
name than the original “ultra-intuitionism”. 

The program indicated in Esenin-Vol’pin (1961, 1970, 1981) so far has 
not progressed beyond the initial stage. There are, however, certain re- 
searches into the possibility of introducing feasible numbers, e.g. by Parikh 
(1971), which tie up with considerations concerning complexity. Geiser 
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(1974) has applied the technique of non-standard models to ultra-finitist 
arithmetic. 


5. Notes 


5.1. Books on constructive mathematics and its metamathematics. Heyting 
(1956) is still a very readable introduction to intuitionistic mathematics. 
Brouwer’s own approach is illustrated in his Cambridge lectures (1981). 

Dummett (1977) is an introduction to intuitionism with only a little 
mathematics, but it contains a good deal of logic and metamathematics as 
well as philosophical discussion. : 

For a development of constructive mathematics in the style of E. Bishop 
see Bishop and Bridges (1985) and Bridges (1979). 

Aberth (1980) gives an introduction to recursive analysis; Kushner (1973) 
(recently translated) is an excellent introduction to constructive recursive 
mathematics in the spirit of A.A. Markov. 

Of the monographs on the metamathematics of constructive mathematics 
we mention Beeson (1985), Gabbay (1981) (intuitionistic predicate logic), 
Kleene and Vesley (1965) (intuitionistic analysis), Lambek and Scott (1986) 
(for connections between intuitionistic logic and category theory), and 
Troelstra (1973). Further information can be obtained from the notes at the 
end of each chapter, and the bibliography by Miiller (1987). 


5.2. Hypothetical reasoning. is also essential for the development of con- 
structive mathematics. 

Certain passages in Brouwer’s thesis might erroneously create the impres- 
sion that he did not want to consider hypothetical constructions, in particu- 
lar constructions which afterwards turn out to be impossible (1907, reprint, 
p. 22). In any case there is no doubt that in his later papers Brouwer 
accepted hypothetical reasoning, e.g. in Brouwer (1923) where he shows 
that the triple absurdity of a statement is equivalent to its absurdity (the 
logical law A < ———4- in our notation). 

The obvious intuitive objection is: how can we have a clear mental 
picture of a construction which afterwards turns out to be impossible? This 
objection has been raised at least twice in the history of intuitionism (cf. 
Troelstra 1983A). 

Freudenthal (1937) observes “We first ask ourselves how we should 
intuitionistically understand a proposition [“Satz” in the German original]. 
The most obvious explanation is to regard a proposition as the assertion of 
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a fact, verified by a proof [...]. This explanation must be treated with 
caution, since, if a proposition is only the assertion of a fact, then the 
separation between proof and proposition is not justified any longer”. Also, 
“If we recall that the correct formulation of a proposition is its proof, we 
immediately see that the construction of a proof of b from a proof of a, or 
the reduction of the solution of problem 6 to the solution of problem a is 
possible only if a has been shown to be correct — and thus implication is 
useless”. 

Heyting (1937) in his reply remarks “The following simple example 
shows that the problem a> b in certain cases can be solved without a 
solution for problem a being known. For a I take the problem ‘find in the 
sequence of decimals of 7 a sequence 0123456789’, for b the problem ‘find 
in the sequence of decimals of 7 a sequence 012345678’. Clearly b can be 
reduced to a by a very simple construction”. 

A late echo of this discussion is found in the attempt by Griss (see e.g. 
1946, 1955) to develop intuitionistic mathematics without negation. 


5.3. The BHK-interpretation. may be regarded as implicit in Brouwer’s 
writings (for example in his treatment of negation in Brouwer 1923). 
Heyting (1930C, 1931, 1934) made the interpretation explicit. In Heyting 
(1930C, 1931) a proposition is explained as “the expectation to find a 
certain condition fulfilled”; asserting a proposition is then the same as 
stating that a construction has been found which shows the condition to be 
fulfilled. 

Kolmogorov (1932) gave an interpretation of intuitionistic propositional 
logic as a calculus of problems. Thus for example, A — B represents the 
problem to solve problem B when a solution to problem A is known, or 
equivalently, to reduce the solution of B to the solution of A. Heyting 
(1934) contains the clause for implication as presented here; he also 
extended Kolmogorov’s interpretation to predicate logic. Later Heyting 
came to regard Kolmogorov’s interpretation as essentially the same as his 
own (1958), 

Kreisel (1962, 1965; for an exposition see also Troelstra 1969) proposed a 
version of the BHK-interpretation where the construction establishing an 
implication A — B consists of a pair (po, P;) where py is the construction 
transforming any proof of A into a proof of B, and p, a construction 
verifying this fact. In addition 2, := “p proves A”, and m( Po, p;, A, B) = 
“p, shows that, for all g, g proves A = po(q) proves B” are assumed to be 
decidable for arbitrary p, po, p,;, A, B. The clause for universal quantifica- 
tion is similarly modified. 
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Kreisel proposed this version in the hope of obtaining interesting 
new models for intuitionistic systems, but this hope was not fulfilled. 
N. Goodman developed a rather complicated theory of constructions based 
on Kreisel’s modified interpretation of the clauses for — and V (see e.g. 
1970), where the constructions were divided into levels, but it turned out 
that for his principal technical result — the conservativeness of the axiom of 
countable choice over intuitionistic arithmetic — these features of Goodman’s 
theory were not needed. 

Sundholm (1983) argued at length that the insight that “for all gq: 
q proves A => p,(q) proves B” should not itself be regarded as a mathe- 
matical object. Weinstein (1983) on the other hand regards the decidability 
of the proof predicate and the extra clauses in Kreisel’s variant of the 
BHK- interpretation as essential for a foundational justification of intuition- 
istic logic. 

As shown by Note A in Troelstra (1981), it is possible to give trivial 
models with decidable proof predicates 7, and decidable 7. See also the 
discussion in chapter 16. 


Exercises 


13.1. Show that the first two groups of schemas listed in section 3.4 are valid on the 
BHK-interpretation. 


1.3.2. A subfinite set is a subset of a finite set. Give weak counterexamples to the assertions: 
(a) each subfinite set is finitely indexed and (b) each finitely indexed set is subfinite (cf. 3.6). 


1.3.3. Give a weak counterexample to Kénig’s lemma in 2.4. Hint, Cf. the use of the predicates 
A and B in 5.2.12. 


1.3.4. Let D be an inhabited domain and let Ro, Ri, Rz,... be relations over D. Let A be 
any sentence constructed from primitive statements 1, R Ad ) and logical operations 
A,V,27,V,2. 

Associate with each A a set of “proofs” p[A] by induction on the number of logical 
operations in A such that: 4 
(i) p[R,(d)] is inhabited iff the finite sequence d of elements of D satisfies the relation R,, 

and otherwise empty; p[1] = 9; 

Gi) pLB A BY} = p[B] x p[B}; 
(iii) p[B Vv B’] = {(0, x): x © p[B]} U (1, x): x © p[BT}; 
(iv) p[B — B’} = p[B] > p[B’], the set of mappings from p[B] to p[B’; 
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(vy) piVxB(x)] = (f:¥x © D(f(x) © p[B(4@))}: 

(vi) p[AxB(x)] = {((d, x): x © p[B(d)]}. 

Show, using classical reasoning, that A < 3x(x © p[A)) for arbitrary assignments p of sets of 
proofs to R (d. ) satisfying the clauses (i)—(vi). This exercise shows that the BHK-interpretation 
in itself has no “explanatory power”: the possibility of recognizing a classically valid logical 
schema as being constructively unacceptable depends entirely on our interpretation of “con- 
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struction”, “function”, “operation”. 


CHAPTER 1 


INTRODUCTION 


In this chapter we first present a brief characterization of the various 
forms of constructivism which shall play a role in the book later on. After 
the review of constructivist trends, we illustrate the idea of constructivity by 
means of some simple and quite elementary examples in section 2; most of 
the points discussed will receive a more thorough treatment in later chapters. 

Section 3 introduces the Brouwer—Heyting—Kolmogorov interpretation 
of the logical operators, and the method of weak counterexamples. 

Section 4 gives a brief historical survey. 


1. Constructivism 


1.1. Time and again, over the last hundred years certain mathematicians 
have defended an approach to mathematics which might be called “con- 
structive” in the broad sense used in this book, in more or less explicit 
opposition to certain forms of mathematical reasoning used by the majority 
of their colleagues. Some of these critics of the mathematics of their (our) 
time not only criticized contemporary mathematical practice, but actually 
endeavoured to show how mathematics could be rebuilt on constructivist 
principles. 

There are, however, considerable differences in outlook between the 
various representatives of constructivism; constructivism in the broad sense 
is by no means homogeneous, and even the views expressed by different 
representatives of one “school”, or by a single mathematician at different 
times are not always homogeneous. Our descriptions below present a 
simplified picture of this complex reality. We shall attempt a brief char- 
acterization of the principal constructivist “trends” or “schools” playing a 
role in this book. 

In our discussions we shall use the adjective “classical” for logical and 
mathematical reasoning based on the usual two-valued logic, in which every 
meaningful statement is assumed to be either true or false. 


1 
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1.2. Finitism. The principal tenets of finitism are: 

(a) only concretely (finitely) representable structures are objects of 
mathematics; operations on such structures are to be combinatorial in 
nature and hence, of course, effective. 

(b) abstract notions such as (arbitrary) set, operation, construction etc. 
have no place in finitist mathematics. 

Kronecker (1823-1891), who with some justification may be regarded as 
“the first constructivist”, may perhaps more specifically be regarded as a 
precursor of the finitist approach. Finitism is characteristically represented 
by Skolem in his paper (1923) and in two books by Goodstein (1957, 1961). 

Finitism also plays a role in Hilbert’s programme: for methods regarded 
by all mathematicians as incontrovertible, to be used in a proof of freedom 
of contradiction of (the bulk of) existing mathematics, Hilbert wanted to 
draw on finitist mathematics; this of course does not mean that Hilbert is to 
be regarded as a finitist. 

A formal system which is regarded as codifying a typical part of finitist 
mathematics is PRA, the system of primitive recursive arithmetic, to be 
discussed briefly in section 3.2. 


1.3. Predicativism. This may be regarded as “constructivism with respect to 
definitions”. In predicativism: 

(a) Definitions of mathematical objects should be predicative, that is to 
say it is not permissible to define an object d by referring to a collection D 
of which the object d is to be an element; in particular, quantification over 
D in defining d is not permitted (avoidance of a “vicious circle” in 
definitions). 

(b) The demand for predicative definitions is regarded as compatible with 
traditional logic, that is to say mathematical statements are assumed to be 
true or false — regardless of human knowledge (this in contrast to intuition- 
ism, described in the next subsection). 

As a rule the set of natural numbers N is regarded as unproblematic from 
a predicativist point of view. This means that one accepts quantification 
over N and thus arithmetically defined predicates as predicatively meaning- 
ful. 

Having thus grasped the idea of an arithmetically defined set or predicate 
(sets of level 1), we may quantify over such sets in constructing predica- 
tively defined sets of level 2, etcetera. 

On the other hand, defining the least upper bound of a set X of reals 
(defined as Dedekind (left) cuts) as the intersection of all left cuts which are 
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upper bounds for X is not permissible: the least upper bound itself occurs 
among the collection of upper bounds. 

The first to show that parts of analysis could be developed predicatively 
was Weyl in his monograph “Das Kontinuum” (1918). More recently 
Lorenzen in his books (1955, 1965) has given a predicative development of 
analysis. Lorenzen (1955), however, does not accept classical logic outright, 
but justifies its use relative to constructive logic. 

Poincaré and the French “empiricists” such as Borel may be regarded as 
precursors of predicativism, but they did not develop parts of mathematics 
predicatively in a systematic way (cf. also 4.2). 

Predicativism as such does not play a role in this book, but the demands 
of predicativism may or may not be combined with the demands of 
intuitionism and Bishop constructivism and thus the issue of predicativity 
occasionally crops up in other constructivist trends as well. 


1.4. Bishop’s Constructive Mathematics (“BCM”). This approach is typi- 
cally represented by Bishop’s book “Foundations of constructive analysis” 
(1967). In the work of this school there are few discussions of philosophical 
principles, the emphasis is entirely on the practice of constructive mathe- 
matics, instead of on its philosophy. 

(a) The key phrase is that “mathematical statements should have numeri- 
cal meaning”. This means, in particular, that existential statements must, in 
principle, be capable of being made explicit (and also, in asserting A V B, 
the choice between A or B must be possible in principle): one can only 
show that an object exists by giving a finite routine for finding it. 

(b) In BCM it is not assumed that all mathematical objects are to be 
given in the form of an algorithm (in the mathematically precise sense of 
recursive function theory). Choice sequences (cf. 1.6 below) are not ad- 
mitted as legitimate objects. Bishop does not hesitate to accept abstract 
concepts such as “rule” or “operation”. 

In BCM no definite position is taken with respect to the admissibility of 
impredicative definitions; in practice, generalized inductive definitions are 
admitted (Bishop 1967, p. 68); for a discussion of this type of definition see 
section 4.8. 


1.5. Constructive Recursive Mathematics (“CRM”). This is a form of con- 
structivism developed by Markov (1903-1979) from ca. 1950 onwards. In 
the sense we understand it here it is typically represented by the writings of 
Markov, Shanin and their students in the period 1950-1967. 
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A feature CRM has in common with finitism is that mathematical objects 
should be finitely representable. More specifically: 

(a) The objects of mathematics are algorithms, understood in a mathe- 
matically precise sense: algorithms can be presented as “words” in some 
finite alphabet of symbols. 

(b) Limitations due to finite memory capacity are disregarded, the length 
of symbol strings is unbounded (though always finite). 

(c) Logically compound statements not involving 4, V are understood in 
a direct way, but existential statements and disjunctions always have to be 
made explicit. 

(d) If it is impossible that an algorithmic computation does not terminate, 
we may assume that it does terminate (“‘Markov’s principle”). 


1.6. Intuitionism. Intuitionism will be understood here as the constructive 
approach to mathematics in the spirit of Brouwer (1881-1966) and Heyting 
(1898-1980). The philosophical basis of this approach is already present in 
Brouwer’s thesis (1907) but with respect to the mathematical consequences 
Brouwer (1918) is a more appropriate starting point. The basic tenets may 
be summarized as follows. 

(a) Mathematics deals with mental constructions, which are immediately 
grasped by the mind; mathematics does not consist in the formal manipula- 
tion of symbols, and the use of mathematical language is a secondary 
phenomenon, induced by our limitations (when compared with an ideal 
mathematician with unlimited memory and perfect recall), and the wish to 
communicate our mathematical constructions to others. 

(b) It does not make sense to think of truth or falsity of a mathematical 
statement independently of our knowledge concerning the statement. A 
statement is true if we have proof of it, and false if we can show that the 
assumption that there is a proof for the statement leads to a contradiction. 
For an arbitrary statement we can therefore not assert that it is either true 
or false. 

(c) Mathematics is a free creation: it is not a matter of mentally 
reconstructing, or grasping the truth about mathematical objects existing 
independently of us (this is in contrast to e.g. the French empiricists; cf. 
4.2). 

It follows from (b) that it is necessary to adopt a different interpretation 
of statements of the form “there exists an x such that A(x) holds” and “A 
or B holds”. In particular, “A or not A” does not generally hold on the 
intuitionistic reading of “or” and “not”; we return to this in section 3. 
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In agreement with, but not necessarily following from (c), intuitionism 
permits consideration of unfinishable processes: the ideal mathematician 
may construct longer and longer initial segments a(Q),...,a(m) of an 
infinite sequence of natural numbers a where a is not a priori determined 
by some fixed process of producing the values, so the construction of a@ is 
never finished: «a is an example of a choice sequence. We return to this topic 
in section 4.6. 

Unfinishable objects such as choice sequences clearly do not fit into the 
framework of CRM, the viewpoint described in the preceding subsection. 

On the other hand, “Markov’s principle”, mentioned under (d) in 1.5, is 
not considered as intuitionistically valid. In fact, it is difficult to decide the 
validity of Markov’s principle by informal analysis alone. We shall return to 
Markov’s principle in section 4.5. 


1.7. In this book we shall have frequent occasion to refer to intuitionism, 
CRM and BCM in particular. However, the present book is not written 
from the viewpoint of any particular school or trend mentioned above, 
inasmuch we shall not accept any of these views as normative for mathe- 
matics. 

We do present constructive mathematics (in the wide sense) as a legiti- 
mate part of mathematics, containing material which is mathematically 
interesting, regardless of any philosophical bias. 

Constructive mathematics, and especially its logic and its metamathe- 
matics are of considerable interest for the philosophy of mathematics. 
There are some indications that intuitionistic logic can be useful in other 
parts of mathematics too, such as topos theory (see e.g. Johnstone 1977, 
1982), or theoretical computer science (see for example Constable et al. 
1986). 


2. Constructivity 


2.1. In this section we shall attempt to illustrate the idea of constructivity 
by means of some quite elementary examples. Most of the points discussed 
will receive a more thorough treatment in later chapters. 

Which objects can be said to exist as (mental) constructions? Natural 
numbers are usually regarded as unproblematic from a constructive point 
of view; they correspond to very simple mental constructions: start thinking 
of an abstract unit, think of another unit distinct from the first one and 
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consider the combination (“think them together’). The indefinite repetition 
of this process generates the collection N of natural numbers. 

It should be pointed out that already here an element of idealization 
enters. We regard 5, 1000 and 10!” as objects of “the same sort” though 
our mental picture in each of these cases is different: we can grasp “five” 
immediately as a collection of units, while on the other hand 10!” can only 
be handled via the notion of exponentiation; 1000 represents an inter- 
mediate case. Visualizing 10!” asa sequence of units is out of the question. 
Exponentiation as an always performable operation on the natural numbers 
involves a more abstract idea than is given with the generation of N. 

Certain mathematicians, such as A.S. Esenin-Vol’pin have tried to dis- 
tinguish between “reasonably” small and “unreasonably” large numbers, 
and between different “natural number structures” according to the princi- 
ples needed to generate them. Permitting exponentiation may result in a 
bigger number sequence with a more complex structure, than in the case 
where exponentiation is not considered, although addition and multiplica- 
tion are accepted. This approach might be described as ultra-finitism. The 
name “ultra-intuitionism” has sometimes been used, but seems less ap- 
propriate. There are considerable obstacles to be overcome for a coherent 
and systematic development of ultra-finitism, and in our opinion no satis- 
factory development exists at present. In this volume we shall therefore 
refrain from any further discussion of ultra-finitism except for some re- 
marks in 4.8, 

Thus all the constructivist schools described in section 1 contain elements 
of idealization, that is to say their description of constructive mathematical 
principles contains “theoretical” elements. In particular, we regard 
Brouwer’s intuitionism and Markov’s constructivism as “theories about 
mathematics”. It is also for this reason that one cannot identify intuition- 
ism with a psychologistic approach to mathematics. 


2.2. A non-constructive definition. Let A be any mathematical statement 
which at present has been neither proved nor refuted (e.g., A =“there are 
infinitely many twin primes”). Ordinarily one permits in mathematics 
descriptions (definitions) of natural numbers such as the following 


= t if A holds, 
P 2 otherwise. 


Constructively this is unacceptable as the description of a natural number 
since, as long as the truth of A has not been decided, we cannot identify p 
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with either 1 or 2; in other words we do not know how to obtain p by 
“thinking some abstract units together”, since we do not know whether to 
stop at one unit or to go on till we have two distinct units. 


2.3. Once having accepted the natural numbers, there is also no objection 
to accepting pairs of natural numbers, pairs of pairs etc. as constructive 
objects; and this permits us to define Z (the integers) and Q (the rationals) 
in the usual way as pairs of natural numbers and pairs of integers 
respectively, each modulo suitable equivalence relations. 

Infinite sequences of natural numbers, integers or rationals may be 
constructively given by some process enabling us to determine the nth term 
(value for n) for each natural number n; in particular, one may think of 
sequences given by a /aw (or recipe) for determining all its values (terms). 

Thus a real number can be specified by a fundamental sequence of 
rationals together with a Cauchy modulus, that is to say we specify a 
sequence (7,,),, & N > Q and a sequence a € N > N (the modulus) such 
that for all k € N and all m, m’ > ak 


Tn = Tn | s 2F, 


Two such sequences (7,,),, (5,), are Said to be equivalent (equal as real 
numbers) if for all k © N there is an n € N such that for all m 


ia onal pay 


Much of the elementary theory of real numbers can then be developed in 
exactly the same way as usual; we shall do this in chapter 5. 

We now give an often quoted example of a non-constructive proof for the 
following 


PROPOSITION. There exist two irrational real-numbers a, b such that a? is 
rational. 


Proor. (/2)”” is either rational, and then we can take a = b = /2, or (/2)"” 
is irrational, and then we can take a = (/2)"*, b= 2. O 


This proof does not enable us to construct a as a real number, that is to 
say we cannot compute a with any desired degree of accuracy, as required 
by our description of what it means to be given a real number. In other 
words, we do not know how to find an arbitrarily close rational approxima- 
tion. Note however, that our objections to the proof depend on reading 
“there exist” as “one can construct”. 
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A constructive proof of the proposition above is possible, e.g. by an 
appeal to Gelfond’s theorem: if a € {0,1}, a algebraic, b irrational alge- 
braic, then a? is irrational (even transcendental). Cf. e.g. Gelfond (1960, p. 
106, theorem 2). 

As a second example of a non-constructive argument, we consider the 
classical proof of Kénig’s lemma. 


2.4, PROPOSITION (KGnig’s lemma). Let T be an infinite, finitely branching 
tree, then T has an infinite branch. 


Proof. We “construct” the infinite branch a € N — T as follows. For a0 
we take the root of 7. Suppose an to have been constructed such that an 
has infinitely many successors. Among the finite set of immediate successors 
to,-.-,¢, Of an there is at least one ¢; such that ¢; has infinitely many 
successors; take a(n + 1) = ¢,. Clearly we can continue this process inde- 
finitely and an infinite branch is generated. O 


Given some ordering of the nodes, we can make the description of @ in 
the proof more definite by taking for a(n + 1) the first ¢,; having infinitely 
many successors. Nevertheless, this “construction” of a is not constructive, 
since we do not know, in general, how to decide whether a finite sequence 
in T has infinitely many successors or not; so the proof above does not give 
us a recipe for producing an for each n. 

The preceding examples of non-constructive proofs show that the natural 
constructive reading of “A or B” and “4xA(x)” as “we can decide between 
A and B” and “we can construct an x such that A(x)” respectively does 
not agree with all the laws of classical logic. In particular, “A or 4A” does 
not hold if we read “or” as above: we have no reason to assume that we can 
always decide, for any assertion A, whether A is true or refutable. 

In actual mathematical practice, one is often interested in constructiviz- 
ing “locally”, e.g. making existential statements of a specific theorem 
explicit by estimating bounds etc. In contrast, we shall mostly investigate 
globally constructive theories, where our logic can consistently be interpre- 
ted as permitting explicit realizations of all existential statements, and all 
theories should be read with a strong constructive meaning of J and Vv. 


3. Weak counterexamples 


In 1908 Brouwer introduced for the first time “weak counterexamples”, for 
the purpose of showing that certain classically acceptable statements are 
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constructively unacceptable (Brouwer 1908). Too much emphasis on these 
examples has sometimes created the false impression that refuting claims of 
classical mathematics is the principal aim of intuitionism. 

Nevertheless, the weak counterexamples are pedagogically useful and 
may serve to motivate e.g. Kripke semantics, which we shall introduce in 
the next chapter. Before we can explain these counterexamples we must 
explain in a more systematic way the constructive interpretation of the 
logical operators. 

In discussing pure logic we shall treat “constructive” and “intuitionistic” 
as synonymous. 


3.1. Logical operations; the BHK-interpretation. In the actual building of 
constructive mathematics we do not need logic; nevertheless we find it 
convenient to use logical symbolism. We shall explain the use of the logical 
operations in a constructive context by the following stipulations (going 
back to Heyting 1934), which tell us what forms proofs of logically 
compound statements take in terms of the proofs of the constituents. 


(H1) A proof of A A B is given by presenting a proof of A and a proof 
of B. 

(H2) A proof of A V B is given by presenting either a proof of A or a 
proof of B (plus the stipulation that we want to regard the proof 
presented as evidence for A V B). 

(H3) A proof of A > B is a construction which permits us to transform 
any proof of A into a proof of B. 

(H4) Absurdity 1 (contradiction) has no proof; a proof of AA is a 
construction which transforms any hypothetical proof of A into a 
proof of a contradiction. 

(H5) A proof of VxA(x) is a construction anich transforms a proof of 
d © D (D the intended range of the variable x) into a proof of A(d). 

(H6) A proof of 3xA(x) is given by providing d € D, and a proof of A(d). 


This explanation is quite informal and rests itself on our understanding of 
the notion of construction and, implicitly, the notion of mapping; it is not 
hard to show that, on a very “classical” interpretation of construction and 
mapping, H1-6 justify the principles of two-valued (classical) logic (cf. 
E1.3.4). In clause H4 the notion of a contradiction is to be regarded as a 
primitive (unexplained) notion. 

If, in the clauses H5, H6 the domain D is “simple” enough, it may 
happen that any d in D so to speak represents its own proof of belonging 
to D (d is presented to us as an object of D). N is an example: a natural 
number is given to us as such, we do not need a separate proof of this fact. 
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For such simple domains, the reference to “a proof of d © D” in H5 and 
H6 may be dropped. The clauses H5 and H6 then read 


(H5’) A proof of VxA(x) is a construction which transforms any d € D 
into a proof of A(d). 

(H6’) A proof of 3xA(x) is given by presenting a d € D and a proof of 
A(d). 


In the sequel we shall refer to the clauses H1-6 above as the 
“Brouwer~Heyting—Kolmogorov interpretation” (BHK-interpretation, see 
5.3). 


3.2. Even if the explanations H1-6 leave a lot of questions open, they 
suffice to show that certain logical principles should be generally acceptable 
from a constructive point of view, while some other principles from 
classical logic are not acceptable. 

As a positive example, consider the statement A — (B — A), for arbi- 
trary mathematical statements A, B. There is a very general proof, obtained 
as follows. Suppose a proves A. Then Ab.a, the constant function which 
assigns a to each argument, transforms a hypothetical proof of B always 
into a proof of A; so Ab.a proves B > A. Therefore if a proves A, then 
\b.a proves B — A; and so the mapping Aa.[Ab.a], which assigns to a the 
constant mapping Ab.a, transforms a proof of A into a proof of B > 4; 
hence Aa.[Ab.a] is a proof of A > (B > A). 

Another consequence is that .L — A is generally provable: since there is 
no proof of 1, Aa.a (or any other mapping) may count as a proof of 
4 — A, since it has to be applied to an empty domain. The principle 
1—A (“ex falso sequitur quodlibet”) has sometimes been rejected as 
non-constructive (Johannson 1937); Heyting (1956, 7.1.3) regards it as an 
extra stipulation fixing the meaning and use of 1, >. 

The rejection of 1 — A means that one regards 1 as a proposition not 
known to be provable (but which may turn out to be provable, if our 
mathematical world is inconsistent). This position leads to minimal logic; 
the difference with intuitionistic logic based on H1-6 is slight (cf. 10.5.2). 

On the negative side, consider the Principle of the Excluded Middle 
(PEM, or “‘tertium non datur’’) 


PEM AV WA, 


which is generally valid in classical logic. Constructively, accepting PEM as 
a general principle means that we have a universal method for obtaining, 
for any proposition A, either a proof of A or a proof of —A, i.e. a method 
for obtaining a contradiction from a hypothetical proof of A. 
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But if such a universal method were available, we could also decide a 
statement A the truth of which has not been decided yet (e.g. the example 
of the infinitely many twin primes in subsection 2.2), which is not the case. 
Thus we cannot accept PEM as a universally valid principle on the 
BHK-interpretation. 

The preceding argument is typically a “weak counterexample”: PEM is 
not refuted, that is to say, we have not shown how to derive an actual 
contradiction from the assumption that PEM is valid. Instead, we have only 
shown PEM to be unacceptable as a BHK-valid principle of reasoning, 
since accepting it means that we ought to have certain knowledge (i.e. a 
decision as to the truth of A) which in fact we do not possess. One might 
also say that the “weak counterexample” makes the validity of PEM for the 
BHK-interpretation highly implausible. 


3.3. In fact, we cannot hope to refute any individual instance of PEM, that 
is to say we cannot find a mathematical statement A such that 4(A V — A). 
This is impossible, since —4=(A V 4A) holds universally: it is an intuition- 
istic logical law. This may be seen on the basis of the BHK-interpretation as 
follows. Suppose 


c establishes (4 V —A). (1) 
Hence, 

if d establishes A V —A, c(d) proves 1. 
Clearly, there are operations e+ ey, f+ f, such that 


e establishes A = e, establishes A V “A, 


f establishes ,A = f, establishes 4 V 4A. 


Therefore if (1) holds, g: e > c(eg) (ie. g = Ae.c(eg)) is a construction 
for A>1L=—A and h:fc(f,) (or hi=Afi.c(f,)) for nA PL. 
Therefore h(g) is a construction for 1 , so the mapping c ~ h(g) estab- 
lishes —=(A V —A). 


3.4. In the chapters dealing with examples from constructive mathematics 
we shall often appeal to a number of simple logical laws valid on the 
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BHK-interpretation; in particular the following are frequently encountered: 
A> A; 
3A @ —W4; 
(A V B) @ AAA -B; 
(A A B) @(A>-=B)e(B>—A); 
(A > B) © (44 ~ 4B) © (A > 4B) 0 (54 V B); 
(A > B) @ =(A4 V B); 
ifA—> B, then 4“B>-A, 
and with quantifiers 
43xA(x) @ VxiA(x); 
a-VxA(x) > Wx=A4(x); 
=3xA(x) @ AVxA(x). 


We leave it as an exercise to verify that these laws are valid on the 
BHK-interpretation. 

On the other hand, besides PEM, the following are examples of principles 
which are valid in classical logic but not on the BHK-interpretation: 


a4 > A; 

AAV A; 

(A > B)v (B> A); 

(a4 AaB) >A VB: 

LCAVGE) ARE 

VxA(x) ~ =AVxA(x); 

374.xA(x) > 3x-4A4(x); 

(A > 3xB(x)) ~ dx(A > B(x)) (x € FV(A)). 


We shall encounter methods to demonstrate their non-validity in later 
chapters (sections 2.5, 4.3, 4.6). 

In the standard formalism, IQC, for intuitionistic predicate logic, dis- 
cussed in the following chapter, terms are supposed to be always defined, 
that is to say terms always denote something, just as in the usual for- 
malisms for classical predicate logic. However, in intuitionistic mathematics 
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possibly undefined terms or expressions which are well-defined only for 
certain values of the parameters, crop up naturally (e.g. x~} for reals x); 
while the classical way out, namely to guarantee that such expressions 
always denote something by means of arbitrary stipulations, is not available 
here (more about this in 5.3.11). Thus some care is needed in handling such 
expressions; if we use Er for “t is well-defined” (corresponding to t | in 
recursion theory), we have in particular 


WxA(x) A Et > A(t), , 
A(t) A Et > 3xA(x). 0) 


Replacing the usual principles VxA(x) — A(t) and A(t) — 3xA(x) by the 
more cautious (1) corresponds on the formal level to the E*-logic of 2.2.3. 
In handling possibly undefined expressions in our treatment of parts of 
constructive mathematics in chapters 5—8 we shall assume (1). 


3.5. Undecidability of equality for the reals. A more sophisticated weak 
counterexample is the following. Let A(n) be a predicate of natural 
numbers, such that A(n) is decidable, but the truth of VnA(n) is unknown, 
i.e. 


Vn(A(n) V =A(n)), 


1 
2WnA(n) V ~WnA(n). 0) 
At the time of writing this section we might take “Fermat’s last theorem” 
as an example of such A, putting A(n) = Vm,m,m,k(m, + m,+m,+k 
<n—>(m, + 1)k*3 + (m, + 1)*3 + (m, + 1)**3), where m,, mz, m3, 
k, n range over N. 
Suppose now that we define a real number x 
sequence of rationals (7,4), defined as follows: 


4 via a fundamental 


n 


1 [27" ifvk <nA(k), 
res 
2-* if A(k) Ak <n A Vk! < kA(k’). 


It is easy to see that this is indeed a fundamental sequence, since 
Vn > m(|r4 — r4] < 27™); also 


x4=0 6 WnA(n). 


The direction from right to left is obvious; conversely, if x4 = 0, then 


14 Introduction [Ch. 1 


ra, < 27” necessarily follows for all n, hence A(n) for all n. Therefore 


x4=O0Vxt#0 


is equivalent to VnA(n) V =VnA(n). Thus we have a weak counterexample 
to the decidability of equality between reals; this example, in its turn, can 
serve for the construction of weak counterexamples to many other state- 
ments, as we shall see below. 


3.6. Let us call a set finite iff it is in 1-1 correspondence with an initial 
segment of N, and consider for example the following subset of the reals: 


X= {0, x4}. 


X is the image of the finite set {0,1} (under the mapping ¢ given by 
pO = 0, pl = x“), or in other words, X is finitely indexed (namely by 
{0, 1}). However, we cannot show X to be finite, since this would require us 
to decide whether it has one or two distinct elements, and this in turn 
would require us to decide WnA(n) V —WnA(n). 

Another consequence of the weak counterexample to x = 0 V x # 0 is, 
that the usual examples of discontinuous functions from R to R fail to yield 
total functions. For example, the function f specified by 


f(x) =1 ifx#0 
f(x) =0 ifx=0 


is classically everywhere defined, but not so constructively. For if we 
suppose f to be constructively defined everywhere on R, we ought to be 
able to construct arbitrarily close approximations to f(x) for any argument 
x; but this means in particular that we should be able to decide 


f(x4) <1 Vv f(x4) > 0, 


hence 4x4 #0 V x4 #0. This. is equivalent to x4 =0 V x4 #0: for 
decidable A(n) one readily sees ~4A(n) © A(n), and since x4 =0 0 
VnA(n), a-WnA(n) > Wn—— A(n), we see that 4x4 # 0 © x4 = 0. Since 
x4 =0 Vv x4 # 0 is undecided, we also cannot assert that f is everywhere 
defined on R. 

At first sight, it seems as if the failure of decidability of equality between 
the reals must have disastrous consequences for the development of real 
analysis. That this is not the case, is due to the fact that in most cases it 
suffices to make decisions which can be made from sufficiently good 
approximations. Thus, instead of using x = 0 V x # 0, we can often use 


Sect. 3] Weak counterexamples 15 


the fact that for reals x, y, z, if x <y, then (z < y) V (x <z) can be 
decided for all z: just choose sufficiently close rational approximations to x, 
y, and z. 

In the weak counterexample of 3.5 A(n) may be any predicate for which 
our state of knowledge is represented by (1) of 3.5. 

We may reformulate 3.5 in terms of (characteristic) functions: let a € 
N — WN and let (r*), be a fundamental sequence determining x* € R, 
defined by 


7 27" if Wk <n(ak = 0), 

r. = 

7 2-* ifeak #OAk <nAWk' <k(ak’ =0); 

thus x“ as defined in 3.5 corresponds to x% if a is the characteristic 
function of A. We have x* = 0 © Vn(an = 0), and for a suitable choice of 
a x*=0 V x* #0 is unknown; but as we saw in 3.3 we cannot show 
(Vn(an = 0) V ~=Vn(an = 0)). On the other hand, it is possible to obtain 
proper refutations of the statement “PEM is generally true”. For the 
universal validity of PEM implies in particular 


V-PEM = Va[Wn(an = 0) V =Vn(an = 0)], 
or equivalently 
Va[Wn(an # 0) V =Wn(an # 0)], 


and as we shall see in chapter 4, both in constructive recursive mathematics 
and in traditional intuitionism certain principles hold which exclude a 
classical interpretation of the logical operators, and in particular imply 
—V-PEM. 

In CRM —V-PEM is a consequence of the assumption that all operations 
have to be algorithmic; in intuitionism —V-PEM is the result of the 
admission of choice sequences as legitimate mathematical objects. 

Thus the reasons for accepting =V-PEM are quite different in these two 
cases: in the first case .V-PEM is the result of a severe restriction on the 
range of possible operations, in the second case it is the result of enlarging | 
the domain of functions in N — N from predeterminate functions given by 
a law to the much wider class of functions determined by successive (more 
or less free) choices. 


3.7. Splitting of notions. Since the BHK-interpretation causes us to reject 
certain classically valid logical principles such as PEM, a splitting of 
notions results: classically equivalent definitions are not always equivalent 
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on the basis of constructive logic, as shown by the example of “finite” and 
“finitely indexed” in the preceding subsection. Constructively, one has also 
to distinguish between “finitely indexed set” and “subset of a finite set” 
(E1.3.2). 

Yet another example is the distinction between “non-empty set” and 
“inhabited set”: X is non-empty iff —W4x(x © X), and inhabited iff 
Ax(x € X). 

A more delicate example is the distinction between inequality and 
apartness between reals: x # y simply means that x and y are not equal, 
but x # y (“x is apart from y”) means that there is a positive distance 
between x and y: 


x#y = 3re Q(x -—y| >r>0). 


Intuitionistically these notions are distinct, but not in CRM (cf. 4.5.4). 
One might expect that the splitting of notions leads to an enormous 
proliferation of results in the various parts of constructive mathematics 
when compared with their classical counterparts. In practice, usually only 
very few constructive versions of a classical notion are worth developing, 
since other variants do not lead to a mathematically satisfactory theory. 


4. A brief history of constructivism 


The conscious practice of constructive mathematics is a relatively young 
phenomenon. Before the introduction of highly abstract notions, going 
hand in hand with abstract proofs, i.e. proofs that went beyond well-trusted 
tangible procedures, there was no need for a separate domain of the 
constructive. Of course, with hindsight, we can locate non-constructive 
proofs in the older mathematics but it would be a historical injustice to 
project our mathematical culture on our forefathers. 

To mention one example that from our point of view personifies non- 
constructive reasoning, proof by contradiction was universally accepted and 
employed. There is one particular domain where proof by contradiction is 
almost a trademark: geometry. From Euclid’s Elements to Hilbert’s 
Grundlagen proof by contradiction is required to set the deductive machin- 
ery working. It seems plausible that partly on the authority of Aristotle’s 
logic, partly on idealized and imperfect knowledge of the mathematical 
universe, the principle of the excluded middle in particular for identity, but 
more generally for arbitrary notions was taken for granted. 


Sect. 4] A brief history of constructivism 17 


In this chapter we will sketch some of the main trends in constructive 
mathematics. We will outline the basic ideas and the underlying philoso- 
phies, historic details of specific mathematical or logical notions or develop- 
ments are to be found im the final section (“notes”) of each chapter. 

In our exposition we have as a rule adopted the terminology that prevails 
today, the reader should be aware however that in the older literature terms 
like neo-intuitionist (for intuitionist), empiricist (for semi-intuitionist), etc. 
occur. 


4.1. Kronecker and the arithmetization of mathematics. A first clearcut 
manifest of constructive mathematics was issued by Kronecker (1823-1891), 
best known for his work in number theory and algebra. 

Kronecker’s constructivism is most easily recognized in his views on the 
status of certain mathematical objects. In his essay “Uber den Zahlbegriff” 
(1887) he outlined the project of “arithmetizing’” Algebra and Analysis; 
that is, to found these disciplines on the most fundamental notion of 
number. He explicitly, with a reference to Gauss, restricted this arithmetiza- 
tion to analysis and algebra; geometry and mechanics were left an indepen- 
dent existence, as not being purely determined by our mind, but referring to 
a certain reality outside our mind. 

In a number of lectures Kronecker developed a unified theory of num- 
bers, posthumously published by Hensel (Kronecker 1901), in which num- 
bers, congruences, polynomials and rational functions are treated on an 
equal footing. In the spirit of this tradition he obtains the integers from the 
natural numbers by considering N[x] mod (x + 1). 

Similarly rational numbers are introduced by considering congruences 
(Kronecker 1887, p. 261). 

The introduction of algebraic numbers is also based on modulus arith- 
metic, a procedure that is now universally adopted. One consequence of 
Kronecker’s arithmetization project was that he considered a definition (in 
number theory and algebra) acceptable only if it could be checked in a 
finite number of steps whether a given number falls under it or not. This 
viewpoint naturally led to his criticism of “pure” existence proofs. He 
stated that an existence proof for a number could only then be considered 
to be totally exact if it also contained a method to find the number whose 
existence was proven, (Kronecker 1901, p. vi). In Kronecker’s published 
works there is a minimum of philosophizing; apart from a few remarks on 
the nature of his programme he consistently sticks to actual mathematical 
practice. Some of his remarks belong to mathematical folklore, and they 
capture fairly well his basic outlook. Best known is his widely reported 
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statement that, “the Lord made the natural numbers (ganze Zahlen), 
everything else is the work of men”, and the following statement reiterates 
the same idea: “I consider mathematics only as an abstraction of the 
arithmetical reality”. 

Kronecker’s programme was taken up by Jules Molk. In a long paper 
(1885) he makes a few remarks along the line of Kronecker. On the topic of 
definitions he says that “definitions should be algebraic and not just logical. 
It is not sufficient to say: something is or it is not”. He points out that a 
logical definition, like “an irreducible function is by definition not reduci- 
ble” is not much help. In algebra a definition of irreducible should allow 
one to test in a finite number of steps that a function does not factorize in 
nontrivial factors. Only such a test will give meaning to the words reducible 
and irreducible. 

It must be pointed out that Molk does not contest the truth of statements 
like “a bounded set of numbers has a least upper bound”, he views this 
statement as a logical truth that does not belong to algebra. In the light of 
Kronecker’s remarks on the nature of mathematical objects and proofs it 
seems likely that Kronecker would reject the above statement as meaning- 
less. 


4.2. The French semi-intuitionists. The next stage in the evolution of con- 
structive mathematics was the discussion about Zermelo’s proof of the 
well-ordering theorem. Zermelo’s use of the axiom of choice in its full 
strength raised a mixed variety of objections. In particular a group of 
French mathematicians expressed criticism based on more or less clearly 
articulated constructive views. We will not go into the details of the 
axiom-of-choice controversy, but instead concentrate on some of the con- 
structivistic issues. The main representatives of constructive tendencies in 
this controversy were Baire, Borel, Lebesgue, Lusin and Poincaré. They did 
not offer a definite philosophical doctrine, but rather a mixture of views on 
mathematics with constructive tendencies. 


Poincaré (1854—1913). Poincaré’s views are well-known, he forcefully at- 
tacked set theory (Cantorism) and “logistics” (i.e. mathematical logic). 
Strictly speaking one cannot count him as a constructivist; his direct 
influence on later developments in constructive mathematics is mostly 
restricted to his views on intuition closely tied up with his criticism of logic. 
He was sceptical of both Aristotelian logic ~ “a syllogism cannot teach us 
anything essentially new” (1902, chapter I)-and the newer variants of 
Russell, Frege and Peano. 
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Poincaré argues that in mathematics one needs more than logic, one 
needs intuition. He mentions three kinds of intuition (1905, chapter I): the 
appeal to the senses and the imagination, generalization by induction, and 
finally the intuition of pure number. The latter one guarantees the exactness 
of mathematics. Mathematics has its specific methods of proof, and the 
method par excellence is the principle of induction, which is according to 
Poincaré a synthetic judgement a priori (“Les derniers efforts des 
logisticiens”, in 1908). 

In a number of ways Poincaré is a forerunner of the later constructivists: 
his recognition of the principle of mathematical induction as an unprovable 
fact, his criticism of the “creative role” of logic, his rejection of the 
Cantorian actual infinite and his embracement of intuition in mathematics. 

One should not, however, expect too much consistency in Poincare’s 
philosophical writings, (cf. Mooij 1966); in his essay “Les mathématiques et 
la logique” (in 1908) he defended, for example, the rather platonistic view 
that “Existence can mean only one thing: freedom from contradiction”. 

The paradoxes around the turn of the century were, in his view, caused 
by impredicative reasoning, vicious circles. In a number of publications he 
defended a predicative version of mathematics. Discussing Richard’s para- 
dox, he wrote “E is the set of all numbers one can define with finitely 
many words, without introducing the set E itself. Without this (restriction) 
the definition of E will contain a vicious circle; one cannot define E through 
the set E itself” (“Les derniers efforts des logisticiens”, in 1908). 

This is the familiar syntactic formulation of the predicative restriction. In 
“La logique de linfini” (in 1913) he provided a mathematical criterion: the 
definition of a set should be such that adding more elements to the universe 
under consideration does change the set. 


Emile Borel (1871-1956). Emile Borel was the most pronounced spokesman 
of the semi-intuitionists, he was closer to Kronecker’s position on a number 
of issues than Poincaré. He maintained, for example, that only effectively 
defined (i.e. by finitely many words) objects exist in the domain of science 
(1914, p. 171, 175), consistency does not suffice for existence. However, 
compared to Kronecker’s “ verifiability in finitely many steps” this “finite 
definability” is clearly a more liberal notion. It allows one, for example, to 
consider the functions f with f(x) = 0 for rational x and f(x) =1 for 
irrational x, as well-defined and existing. 

Lebesgue shared Borel’s views on this point. From Borel’s point of view 
individual real numbers had to be given by a finite definition, so as a set 
they could never get beyond the countable. Realizing how awkward a 
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countable continuum would be, Borel considered the continuum as inde- 
pendently given by intuition: “we accept that the collection c of real 
numbers between 0 and 1 is given, without investigating how they could 
effectively be given” (Borel 1914, p. 16); he spoke of the geometric con- 
tinuum. The same view had already been defended by Hélder in 1892 (cf. 
1924, p. 194) and Brouwer independently elaborated it in his Ph.D. Thesis 
(1907). 

This concept of geometric continuum yields a “whole” that precedes the 
individual elements, and cannot be thought of as“‘the set of its reals” (Lusin 
1930). For the practice of mathematics Borel nevertheless wished to have 
available a sufficiently large part of the continuum that could be considered 
as “made up of individual real numbers’. For this purpose he introduced 
the practical continuum (1914, p. 166), consisting of the finitely definable 
numbers. 

The discussion about Zermelo’s proof of the well-ordering theorem had 
two aspects that are relevant from the constructive viewpoint: the well- 
ordering of the continuum and the axiom of choice. The first aspect gave 
rise to reflections on the nature and structure of the continuum, the second 
one gave rise to reflections on the notion of “infinite succession of choices”. 
The topic of legitimacy of sequences of choices was taken up by Borel (cf. 
1914, p. 168). We will return to it in the context of intuitionism. 

Borel’s philosophical position may be inferred from the following quota- 
tion (1914, p. 173): 

“J do not understand what the abstract possibility of an act, which is 
impossible for the human mind can mean. For me this is a purely 
metaphysical abstraction without any scientific reality”. 

The foundational practice of the semi-intuitionists was mostly carried out 
in the margin of their main mathematical researches. One cannot say that a 
systematic philosophical (or foundational) basis for their constructive con- 
victions had been put forward. In general the constructivistic activities of 
the semi-intuitionists remained “a gentleman’s pastime”; their views have, 
however, played a role in the subsequent development of constructive 
mathematics. 


4.3. Brouwer and intuitionism. In the early years of the century Brouwer 
(1881-1966) had formed his views on mathematics, both pure and applied. 
In 1907 he published his dissertation “On the foundations of mathematics”. 
Since the dissertation was written in Dutch (an English translation ap- 
peared only in his “Collected Works” (1975)) the impact was marginal. 
Nonetheless it is worthwhile to discuss it since it contains much that is 
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basic for understanding his later and riper views. Leaving aside the purely 
mathematical topics we find three main themes: the continuum, mathe- 
matics and experience (applications), mathematics and logic. 

Brouwer recognized in his thesis—as far as we can see, independently 
from Borel — the continuum as a primitive notion given directly by intui- 
tion. 

Brouwer rejected the reduction of the continuum by means of any of the 
approximation methods of irrationals by rationals, since he considers these 
approximations as a process that unfolds in stages and hence at each stage 
remains denumerable (observe the analogy with Borel’s continuum, but 
note that Brouwer was probably not aware of Borel’s views until the 
international congress of mathematicians in 1908 in Rome). 

Brouwer’s view of mathematics is basic for the understanding of the 
development of intuitionism, his views on existence, mathematical objects, 
logic, language are all consequences of this characterization of mathe- 
matics: 

Mathematics is a free creation of the mind, it allows the construction of 
mathematical systems on the basis of intuition. Mathematical objects, 
hence, are mental constructs. 

Some consequences drawn by Brouwer are: 

(1) From the Kantian a priori only the “intuition of time” remains. 

(2) Mathematics is independent of logic; logic is an application of 
mathematics. 

(3) Mathematics cannot be founded upon the axiomatic method. 

In particular Brouwer rejected in his thesis Hilbert’s (early) formalism 
(1905) and Cantorian set theory. Brouwer’s criticism of certain parts of 
traditional mathematical practice has led to the unfortunate mistaken view 
that Brouwer’s “revolution” was mainly a negative one (cf. Hilbert 1922). 
Actually Brouwer introduced a number of innovations that lifted his 
intuitionistic'mathematics well above the level of securing some safe corners 
in the building of mathematics. 

The concept that is most closely tied to Brouwer’s name is that of “choice 
sequence”. Sequences of numbers determined by choice had already oc- 
curred in the literature before Brouwer’s first exposition in 1918. Borel had 
discussed them in relation with Zermelo’s paper on the well-ordering 
theorem (1914, pp. 168-169), and he had pointed out that in order to 
recover the “complete arithmetical concept of the continuum” one needs 
countable sequences of choices (1914, p. 161). Before him Du Bois Reymond 
had already considered real numbers with decimals determined by throwing 
a die (1882). However, with Borel and Du Bois Reymond these sequences 


22 Introduction [Ch. 1 


remained curiosities. It was Brouwer who discovered how to exploit the 
choice feature in practical mathematics. After rejecting choice sequences as 
non-intuitionistic (1912), he came to accept them (1914), and in 1918 he 
published his first basic intuitionistic paper in which the notion of choice 
sequence was a corner stone. In that paper his continuity principle was also 
enunciated and immediately used to show the non-denumerability of N‘, 
for a historical discussion cf. Troelstra (1982A). 

Further refiection on the nature of choice sequences led Brouwer to a 
proof of the “uniform continuity theorem” (i.e. all functions from [0,1] to 
R are uniformly continuous). Analysis of the proof shows that it is based on 
yet another intuitionist principle, the principle of Bar Induction (cf. section 
4.8, and 4.10.5). 

The negative aspects of Brouwer’s foundational activities are to be found 
in his criticism of the mathematical practice of his day. From his point of 
view, which characterized mathematics as an inner, mental constructional 
activity, language and logic were secondary phenomena. Language was a 
highly fallible tool for communication, not to be trusted as a source of 
insight, and logic was a product derived from mathematics. For Brouwer’s 
views on language cf. (1907, 1929, 1949). 

As far as logic was concerned, Brouwer had rather conservative views. 
Sometimes he understood by logic the traditional Aristotelian theory of 
syllogisms, sometimes the logic of Peano and Russell. 

Mathematical reasoning, according to Brouwer, consists of constructing 
mathematical structures, and the frequent appearance of chains of applica- 
tions of logical rules (syllogisms) is a practice that is only justified by 
mathematical constructions accompanying each application of a rule. The 
above applies to intuitive logical arguments. Theoretical logic, according to 
Brouwer, is an application of mathematics, and an empirical science; it will 
never teach us anything concerning the organization of the human intellect. 

We highlight a few of Brouwer’s views. 

On quantification (1907, cf. 1975, p. 76): 

“the mistake which so many people, thinking that they could reason 
logically about other objects than mathematical structures built by them- 
selves, and overlooked, that wheresoever logic uses the word all or every, 
this word, in order to make sense, tacitly involves the restriction: insofar as 
belonging to a mathematical structure which is supposed to be constructed 
beforehand”. 

On consistency (in particular Hilbert’s early formalism, 1905). Brouwer’s 
comments are roughly as follows (cf. 1975, p. 78): Not recognizing any 
intuitive mathematics, the consistency of (e.g.) number theory has to be 
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proved in a linguistic framework by means of axioms (among which 

certainly the axiom of complete induction), next one has to prove the 

consistency of this system. However, 

(1) this does not constitute any progress over the previous stage, 

(2) consistency does not imply the existence of an accompanying 
mathematical structure, 

(3) even if the mathematical system of reasoning exists, this does not 
entail that it is alive, i.e. that it accompanies a sequence of thoughts, and 
even if the latter is the case this sequence of thoughts need not be a 
mathematical development, so it need not be convincing. 

On set theory: Cantor’s justification of the existence of sets given by 
comprehension (or even separation) is incorrect. For a number of important 
notions in set theory no existence grounds exist, e.g. the second number 
class and NF do not exist. (1975, p. 81, 87). 

On the levels of meta-theory: In the logico-linguistic treatment of mathe- 
matics one can discern certain stages (1975, p. 174): 

(1) The construction of pure mathematical (mental) systems. 

(2) The linguistic description of mathematics. 

(3) The mathematical study of (2), including the logical structure. 

(4) Abstraction from the meaning of the expressions of (3), and of the 
logical operations, thus creating a second-order mathematical system 
(this is the system of the logicists). 

(5) The language of (4). 

(6) The mathematical study of (5) (a step made by Hilbert, but not by 
Peano and Russell). 

(7) Abstraction from the meaning of the expressions of (6), thus creating a 
mathematical system of the third order. 

(8) The language of (7) (this is the last stage considered by Hilbert). Etc. 

The well-known, and somewhat notorious issue of the principle of the 
excluded middle (PEM) has sidetracked many of Brouwer’s contemporaries 
into believing that intuitionism was primarily seeking a revision of the 
logical apparatus of mathematics. The title of Brouwer’s first full scale 
exposition (1918), may very well have fostered this misunderstanding. 

In the paper “The unreliability of the logical principles” (1908), Brouwer 
drew the inevitable consequence from his views on mathematics and refuted 
PEM using an undecided property of the decimals of 7. 

Brouwer himself never elaborated in writing the meaning of the logical 
constants, that step was eventually made by Heyting and Kolmogorov, but 
in his writings a “proofs as constructions”-concept is clearly implicit (cf. 
1907, chapter III; 1908). 


24 Introduction {Ch. 1 


In the late twenties Brouwer started to look for a way to exploit the 
peculiar properties inherent in the creative subject, the results appeared in 
print only after the second world war, (1948, 1949). The part of intuitionism 
based on the creative subject has remained rather marginal. 

Brouwer’s programme of reconstructing mathematics along intuitionistic 
lines found few adherents; a considerable part of the programme was 
carried out by his principal student A. Heyting, whose best-known contri- 
bution to intuitionism was the formalization of intuitionistic logic (and 
arithmetic) and the formulation of a natural semantics for it. This semantics 
is the BHK-interpretation of section 3; it is sometimes called the proof 
interpretation (Heyting 1934 and 1.5.3 below). 


4.4. Hermann Weyl (1885-1955) and predicativity. The tradition of Poin- 
caré and Russell, which laid the blame of inconsistencies and paradoxes on 
impredicative definitions, was taken up in 1918 by Weyl, who published a 
monograph, “Das Kontinuum” (1918). Like his predecessors, Weyl was out 
to avoid vicious circles; he rebuilt analysis using only a very restricted kind 
of sets and functions. Indeed, he only allowed arithmetically definable sets, 
i.e. sets defined by comprehension over formulas without bound set vari- 
ables. Weyl’s reconstruction was quite successful, a large portion of tradi- 
tional mathematics was recovered. 

Weyl’s philosophical position in this monograph (and in foundational 
matters in general) was that of Husserl’s Phenomenology. 

However, after learning of Brouwer’s intuitionistic foundation of 
mathematics he gave up his own programme and embraced intuitionism. In 
“Uber die neue Grundlagen Krise der Mathematik” (1921) he unequiv- 
ocally sided with Brouwer in the latter’s revolution, but later in life he 
dissociated himself from intuitionism and foundations in general. Lorenzen 
1955 can be viewed as a continuation of Weyl’s original programme. 

Both Weyl and Lorenzen were concerned with the actual revision of 
mathematical practice. Later work in predicativism was mainly of a meta- 
mathematical nature (cf. Feferman 1964, 1978, Kreisel 1960). So far predi- 
cative mathematics has remained a curiosity for logicians rather than a 
realistic alternative for the working mathematician. Recently, however, 
there has been a renewed interest in proof theoretically weak systems with 
strong expressive power, (Takeuti 1978, Friedman 1980, Friedman et al. 
1983), e.g. as in Friedman’s “reverse mathematics”. In fact, a surprisingly 
large body of mathematics can be developed in quite weak systems, much 
weaker than so-called predicative analysis. In many cases even conservative 
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extensions of first-order arithmetic allow the development of a good deal of 
traditional mathematics, in particular analysis. 


4.5. Finitism. Brouwer’s intuitionism contained unmistakable abstract no- 
tions; in particular sets (species), choice sequences, spreads are abstract 
objects in mathematics proper. The use of abstract notions has been 
recognized and criticized by a number of mathematicians, e.g. Kronecker, 
Skolem and Goodstein. They proposed a very narrow version of construc- 
tive mathematics, one in which only concrete combinatorial operations on 
(strictly) finite mathematical objects are allowed. The prototype of “finite 
mathematical object” is “natural number” and a good example of a 
“combinatorial operation” is provided by a “table of multiplication”. 

Skolem, already in 1919, systematically considered what kind of combi- 
natorial operation on the natural numbers can be introduced (1923). 

He developed a sizable part of primitive recursive arithmetic, i.e. he 
studied primitive recursive functions avant la lettre in a quantifier-free 
system. In fact, he worked in an informal mathematics, but he initiated a 
long tradition in the theory of primitive recursive arithmetic, cf. Goodstein 
(1957). 

Skolem explicitly adopted Kronecker’s foundational viewpoint: “a 
mathematical definition is a genuine definition if and only if it leads to the 
goal by means of a finite number of trials” (1923). As far as the founda- 
tions of mathematics are concerned, Kronecker is a forerunner of finitism. 

The restricted combinatorial view also turned up in Hilbert’s programme. 
Hilbert, in his improved formalist philosophy considered the finitistic, 
combinatorial mathematics as the truly meaningful part of mathematics. 
Mathematics dealing with the actual infinite, e.g. full arithmetic, was to be 
reduced to the finitary mathematics (arithmetic). This is the basic idea of 
Hilbert’s programme: ideal statements are to be granted a meaning in terms 
of finitary statements. Finitistic mathematics, of course, is completely 
constructive and hence a part of intuitionistic mathematics. Some early 
writers on the foundations of mathematics have confused finitistic mathe- 
matics and intuitionistic mathematics, e.g. Herbrand (1931). 

After Gédel’s demonstration that classical arithmetic could be reduced to 
intuitionistic arithmetic via his “negative translation”, (1933), it was clear 
that even intuitionistic arithmetic went beyond finitism. 


6 


4.6. Constructive recursive mathematics and Markov’s School. Another con- 
cept of constructivity, based on the theory of algorithms or recursive 
functions, has presented itself in the thirties. This development, however, 
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had a totally different philosophical motivation. At first primitive recursive 
functions and recursive functions were introduced as tools for meta- 
mathematical research. Gédel used (what we now call) primitive recursive 
functions in his proof of the incompleteness theorem (1931), and as pointed 
out above, Skolem had already in 1919 developed the basis of primitive 
recursive arithmetic (without introducing the class of primitive recursive 
functions explicitly) (Skolem 1923), Elaborating a suggestion of Herbrand, 
Gédel introduced the wider class of recursive functions in (1934). 

Along completely different lines Church introduced the A-calculus (1932), 
a theory that only used A-abstraction and application. In this theory the 
natural numbers could be represented and Kieene showed that the recursive 
functions were A-definable. 

Curry, in the meantime, had developed combinatory logic (1930), follow- 
ing the early work of Schdnfinckel. 

The equivalence of A-calculus and combinatory logic was established by 
Rosser, and the equivalence of the Herbrand—Gédel recursive functions 
with the A-definable ones was proved by Church (1936) and by Kleene 
(1936). 

An interesting conceptual characterization of algorithms was undertaken 
by Turing, who used abstract machines, now known as Turing machines 
(Turing 1937). Turing analyzed the intuitive concept of “mechanically 
computable by a human operator” and put forward a convincing argument 
that “computable” coincides with “computable by a Turing machine”. 
Church had also stated in 1936 a characterization of ““computable”, but in 
terms of recursiveness, he equated “computable” and “recursive” (1936). 
This identification of “effectively computable” and “recursive” has become 
known as Church’s Thesis. 

The theory of (partial) recursive functions has become a cornerstone of 
metamathematics. The reader will find more information on the historical 
details in the relevant chapters. 

Mathematics on the basis of Church’s Thesis was propagated by Markov, 
who founded a school of constructive mathematics in Moscow. Markov laid 
down the following principles for constructive mathematics: 

(1) the objects of constructive mathematics are words in various al- 
phabets. 

(2) abstraction through potential realizability is allowed, but not abstrac- 
tion through the actual infinite. 

(1) must be understood as including e.g. recursive functions, as given (or 
coded) by a set of equations or an index. An example of a potentially 
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realizable process is the addition of natural numbers, it does not involve the 
actually infinite set of natural numbers, but an algorithm that for any given 
n and m produces (a representation of) n + m. 

Markov formulated his constructive mathematics in terms of normal 
algorithms, also known as Markov algorithms (cf. Markov 1954A, Mendelson 
1964). We shall not use Markov’s formalism, but use the more familiar 
recursive functions. 

A consequence of Markov’s view is that number theoretic functions are 
recursive; that is to say the potentially realizable version of this “infinite” 
object must allow a representation by an index. In this sense Markov’s 
constructive mathematics can be viewed as a theory based on Church’s 
Thesis. 

On the one hand Markov is more strict than the intuitionists, e.g. he 
rejects choice objects, on the other hand he is more liberal. This is 
illustrated by his argument for justification of his principle of constructive 
choices (Markov 1962), nowadays called Markov’s Principle. (Markov 
formulated his principle in lectures in Leningrad in 1952/1953, it appeared 
in print in Markov 1954). 

In familiar terms, the principle states that if it is not the case that a 
Turing machine does not halt on a given input, then it does halt. Markov’s 
argument, in terms of Turing machines, is simply as follows; if it is 
impossible that the Turing machine will compute forever, then there is a 
clear algorithm for obtaining the output: just continue the process until it 
halts. Markov added: 

“TI know that intuitionists reject this method of argument, since they do 
not consider it ‘intuitively clear’. In connection with this I consider it 
necessary to make the following remarks. Firstly, my intuition finds this 
sufficiently clear. On the other hand, I can in no way agree to taking 
‘intuitively clear’ as a criterion for truth in mathematics, for this criterion 
would mean the complete triumph of subjectivism and would lead to a 
break with the understanding of science as a form of social activity. If I 
defend this means of argument here, it is not because I find it without error 
according to my intuition, but rather, firstly because I see no reasonable 
basis for rejecting it, and secondly, because arguments of this type make it 
possible to construct a constructive mathematics that is well able to serve 
contemporary natural science” (Markov 1962). 

As Markov indicates, his principle is not accepted by the constructivists 
at large; the principle has, however, found its place in the metamathematics 
of constructive mathematics (see e.g. sections 3.5, 4.5). 
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4.7. Bishop and constructive mathematics for the working mathematicians. 
The landscape of constructive mathematics suddenly changed in 1967 when 
Bishop (1928—1983) published his “Foundations of constructive analysis”. 
“This book, he said, had a threefold purpose: first, to present the construc- 
tive point of view; second, to show that the constructive program can 
succeed; third, to lay a foundation for further work. These immediate ends 
tend to an ultimate goal—to hasten the inevitable day when constructive 
mathematics will be the accepted norm”. Bishop’s message was that the real 
business of constructive mathematics is mathematics; it came at a time 
when constructivism was mostly tending its foundations, and it was pre- 
sented by a first-rate mathematician. 

In contrast to Brouwerian intuitionism (even after Heyting’s presenta- 
tion), Bishop’s nrathematics used the language of the working mathemati- 
cian. He attacked mathematics much in the spirit of Kronecker, be it that 
he introduced abstract objects in a more direct way. 

A few examples may help to illustrate Bishop’s position. 

In his view the only way to show that an object exists is to give a finite 
routine for finding it. 

Sets are introduced by him as “the totality of all mathematical objects 
constructed in accord with certain requirements” and they come with a 
given equivalence relation called “equality”. 

The basic higher type objects are operators, i.e. rules which assign 
elements of a set A to elements of a set B. 

Such a rule f must afford an explicit, finite mechanical reduction of the 
procedure for constructing f(a) to the procedure for constructing a. A 
function then, is an operator that respects equality. 

Bishop does not specify the notion of “rule”, but throughout rules could 
be taken to be recursive; that is, Bishop’s mathematics is compatible with 
Church’s thesis. 

The fundamental thesis of Bishop’s mathematics is that “all mathematics 
should have numerical meaning”, which is closer to Kronecker (or even 
Hilbert) than to Brouwer. Bishop foregoes the abstract objects of classical 
mathematics as well as those of the intuitionists. His mathematics rests on 
neutral ground, without ontological or idealistic objects and principles. 
Thus he cannot, like Brouwer, show that all real functions are continuous, 
but as a pragmatist he restricts his attention to continuous functions. 

The reader can see for himself how actual mathematics is done construc- 
tively in Bishop’s instructive book. Bishop had a number of followers, and 
at present a number of mathematicians are constructivizing mathematics 
along the same lines as Bishop. 
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The work of Bishop also raised a number of metamathematical questions, 
the main question being: which principles suffice to found Bishop’s 
mathematics on. There are to date two main approaches to the problem: a 
set theoretical one, proposed by Myhill and Friedman, and an “operations 
and classes” one, proposed by Feferman. The reader may consult chapter 9 
and Beeson'(1985). Bishop’s constructive mathematics was intended as the 
neutral part of constructive mathematics and as compatible with traditional 
classical mathematics. Metamathematical analysis has born this out, one 
can (in principle) extend his mathematics both in the direction of Brouwer’s 
intuitionism and in the direction of recursive mathematics @ Ja Markov. 


4.8. Esenin-Vol'pin’s ultra-finitism. In the opinion of a number of mathe- 
maticians Brouwer’s criticism of traditional mathematics stopped short of 
the potential infinity of mathematics — in particular, of the natural number 
sequence. Borel, for instance, declared that “the very large finite offers the 
same difficulties as the infinite”, Borel (1947), and van Dantzig (1956) posed 
the question “Is 10!” a finite number?” in the tradition of Mannoury 
(1909). Wittgenstein also shared the view that mathematics has no business 
with an idealized infinite set of natural numbers. The first to develop the 
strict finitist view beyond a collection of philosophical reflections was A.S. 
Esenin-Vol’pin. From 1959 onward he undertook the founding of mathe- 
matics on a strictly finite basis. He challenged the view that there is, up to 
isomorphism, a unique sequence of natural numbers. Taking a realistic view 
of natural numbers, there does not seem to be a good reason to assume that 
the successor operation can indefinitely be iterated. So there may be 
universes of natural numbers that do not have the same length. As a 
consequence of the limitations as to the natural numbers that can actually 
be reached, the universe(s) of natural numbers may not be closed under the 
common operations, such as multiplication or exponentiation. The strictly 
finitist view also has its consequences for logic; the derivations of A and 
A — B may still be within reach, but in order to apply modus ponens one 
might have to exceed the available natural numbers necessary for the length 
of the derivation of B. In a sense Esenin-Vol’pin’s program is even more 
restrictive than Hilbert’s finitism, therefore “ultra-finitism” is a more fitting 
name than the original “ultra-intuitionism”. 

The program indicated in Esenin-Vol’pin (1961, 1970, 1981) so far has 
not progressed beyond the initial stage. There are, however, certain re- 
searches into the possibility of introducing feasible numbers, e.g. by Parikh 
(1971), which tie up with considerations concerning complexity. Geiser 
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(1974) has applied the technique of non-standard models to ultra-finitist 
arithmetic. 


5. Notes 


5.1. Books on constructive mathematics and its metamathematics. Heyting 
(1956) is still a very readable introduction to intuitionistic mathematics. 
Brouwer’s own approach is illustrated in his Cambridge lectures (1981). 

Dummett (1977) is an introduction to intuitionism with only a little 
mathematics, but it contains a good deal of logic and metamathematics as 
well as philosophical discussion. : 

For a development of constructive mathematics in the style of E. Bishop 
see Bishop and Bridges (1985) and Bridges (1979). 

Aberth (1980) gives an introduction to recursive analysis; Kushner (1973) 
(recently translated) is an excellent introduction to constructive recursive 
mathematics in the spirit of A.A. Markov. 

Of the monographs on the metamathematics of constructive mathematics 
we mention Beeson (1985), Gabbay (1981) (intuitionistic predicate logic), 
Kleene and Vesley (1965) (intuitionistic analysis), Lambek and Scott (1986) 
(for connections between intuitionistic logic and category theory), and 
Troelstra (1973). Further information can be obtained from the notes at the 
end of each chapter, and the bibliography by Miiller (1987). 


5.2. Hypothetical reasoning. is also essential for the development of con- 
structive mathematics. 

Certain passages in Brouwer’s thesis might erroneously create the impres- 
sion that he did not want to consider hypothetical constructions, in particu- 
lar constructions which afterwards turn out to be impossible (1907, reprint, 
p. 22). In any case there is no doubt that in his later papers Brouwer 
accepted hypothetical reasoning, e.g. in Brouwer (1923) where he shows 
that the triple absurdity of a statement is equivalent to its absurdity (the 
logical law A < ———4- in our notation). 

The obvious intuitive objection is: how can we have a clear mental 
picture of a construction which afterwards turns out to be impossible? This 
objection has been raised at least twice in the history of intuitionism (cf. 
Troelstra 1983A). 

Freudenthal (1937) observes “We first ask ourselves how we should 
intuitionistically understand a proposition [“Satz” in the German original]. 
The most obvious explanation is to regard a proposition as the assertion of 
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a fact, verified by a proof [...]. This explanation must be treated with 
caution, since, if a proposition is only the assertion of a fact, then the 
separation between proof and proposition is not justified any longer”. Also, 
“If we recall that the correct formulation of a proposition is its proof, we 
immediately see that the construction of a proof of b from a proof of a, or 
the reduction of the solution of problem 6 to the solution of problem a is 
possible only if a has been shown to be correct — and thus implication is 
useless”. 

Heyting (1937) in his reply remarks “The following simple example 
shows that the problem a> b in certain cases can be solved without a 
solution for problem a being known. For a I take the problem ‘find in the 
sequence of decimals of 7 a sequence 0123456789’, for b the problem ‘find 
in the sequence of decimals of 7 a sequence 012345678’. Clearly b can be 
reduced to a by a very simple construction”. 

A late echo of this discussion is found in the attempt by Griss (see e.g. 
1946, 1955) to develop intuitionistic mathematics without negation. 


5.3. The BHK-interpretation. may be regarded as implicit in Brouwer’s 
writings (for example in his treatment of negation in Brouwer 1923). 
Heyting (1930C, 1931, 1934) made the interpretation explicit. In Heyting 
(1930C, 1931) a proposition is explained as “the expectation to find a 
certain condition fulfilled”; asserting a proposition is then the same as 
stating that a construction has been found which shows the condition to be 
fulfilled. 

Kolmogorov (1932) gave an interpretation of intuitionistic propositional 
logic as a calculus of problems. Thus for example, A — B represents the 
problem to solve problem B when a solution to problem A is known, or 
equivalently, to reduce the solution of B to the solution of A. Heyting 
(1934) contains the clause for implication as presented here; he also 
extended Kolmogorov’s interpretation to predicate logic. Later Heyting 
came to regard Kolmogorov’s interpretation as essentially the same as his 
own (1958), 

Kreisel (1962, 1965; for an exposition see also Troelstra 1969) proposed a 
version of the BHK-interpretation where the construction establishing an 
implication A — B consists of a pair (po, P;) where py is the construction 
transforming any proof of A into a proof of B, and p, a construction 
verifying this fact. In addition 2, := “p proves A”, and m( Po, p;, A, B) = 
“p, shows that, for all g, g proves A = po(q) proves B” are assumed to be 
decidable for arbitrary p, po, p,;, A, B. The clause for universal quantifica- 
tion is similarly modified. 
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Kreisel proposed this version in the hope of obtaining interesting 
new models for intuitionistic systems, but this hope was not fulfilled. 
N. Goodman developed a rather complicated theory of constructions based 
on Kreisel’s modified interpretation of the clauses for — and V (see e.g. 
1970), where the constructions were divided into levels, but it turned out 
that for his principal technical result — the conservativeness of the axiom of 
countable choice over intuitionistic arithmetic — these features of Goodman’s 
theory were not needed. 

Sundholm (1983) argued at length that the insight that “for all gq: 
q proves A => p,(q) proves B” should not itself be regarded as a mathe- 
matical object. Weinstein (1983) on the other hand regards the decidability 
of the proof predicate and the extra clauses in Kreisel’s variant of the 
BHK- interpretation as essential for a foundational justification of intuition- 
istic logic. 

As shown by Note A in Troelstra (1981), it is possible to give trivial 
models with decidable proof predicates 7, and decidable 7. See also the 
discussion in chapter 16. 


Exercises 


13.1. Show that the first two groups of schemas listed in section 3.4 are valid on the 
BHK-interpretation. 


1.3.2. A subfinite set is a subset of a finite set. Give weak counterexamples to the assertions: 
(a) each subfinite set is finitely indexed and (b) each finitely indexed set is subfinite (cf. 3.6). 


1.3.3. Give a weak counterexample to Kénig’s lemma in 2.4. Hint, Cf. the use of the predicates 
A and B in 5.2.12. 


1.3.4. Let D be an inhabited domain and let Ro, Ri, Rz,... be relations over D. Let A be 
any sentence constructed from primitive statements 1, R Ad ) and logical operations 
A,V,27,V,2. 

Associate with each A a set of “proofs” p[A] by induction on the number of logical 
operations in A such that: 4 
(i) p[R,(d)] is inhabited iff the finite sequence d of elements of D satisfies the relation R,, 

and otherwise empty; p[1] = 9; 

Gi) pLB A BY} = p[B] x p[B}; 
(iii) p[B Vv B’] = {(0, x): x © p[B]} U (1, x): x © p[BT}; 
(iv) p[B — B’} = p[B] > p[B’], the set of mappings from p[B] to p[B’; 
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(vy) piVxB(x)] = (f:¥x © D(f(x) © p[B(4@))}: 

(vi) p[AxB(x)] = {((d, x): x © p[B(d)]}. 

Show, using classical reasoning, that A < 3x(x © p[A)) for arbitrary assignments p of sets of 
proofs to R (d. ) satisfying the clauses (i)—(vi). This exercise shows that the BHK-interpretation 
in itself has no “explanatory power”: the possibility of recognizing a classically valid logical 
schema as being constructively unacceptable depends entirely on our interpretation of “con- 
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struction”, “function”, “operation”. 


CHAPTER 2 


LOGIC 


For a reader who is primarily interested in seeing some examples of 
constructive mathematics, very little logic is needed to understand almost 
all the material in chapters 5-8 (the parts of chapter 4 essential to these 
chapters also require little or no knowledge of formal logic). What is 
presupposed however, is that the reader has made himself familiar with the 
BHK-interpretation discussed in chapter 1, and has acquired some facility 
in reading mathematical statements written with the help of logical sym- 
bols, since we shall use logical notation almost continually. 


Description of the contents of the chapter. The first section is devoted to a 
system of natural deduction for intuitionistic predicate logic IQC. Familiar- 
ity with the basics of classical predicate logic, though not necessarily with 
systems of natural deduction, is presupposed. 

The second section extends the treatment to cover logic with possibly 
undefined terms in two variants (E-logic and E*-logic). 

It is possible to gain more familiarity with constructive logic than is to be 
obtained from chapter 1, without getting too involved in the hardware of 
the formalism, by studying 1.2—9, 2.1-5. 

Section 3 discusses the relations between classical and intuitionistic 
predicate logic, in particular the Gédel-Gentzen “negative translation” and 
its variants. 

Section 4 discusses systems for intuitionistic predicate logic of the 
axiomatic type (sometimes called “Hilbert-type” systems), which are fre- 
quently encountered in the literature and are often convenient in meta- 
mathematical work. 

Kripke’s semantics for IQC is introduced in section 5; section 6 contains 
a Henkin-type completeness proof. Kripke models provide an intuitively 
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very appealing semantics for IQC -not (obviously) related to the BHK- 
interpretation - which permits us to show quite easily that certain state- 
ments are not derivable in IQC. 

Section 7 finally discusses definitional extensions of systems obtained by 
adding definable predicates, total or partial functions. 


1. Natural deduction 


For the development of constructive mathematics as such we do not need a 
formal system of logic. Nevertheless it is convenient to have a set of logical 
rules available: thus we do not have to go back to the Brouwer—Heyting— 
Kolmogorov interpretation (1.3.1) each time we want to justify the use of a 
logical principle in our arguments. 

We have chosen natural deduction as our starting point, because it agrees 
well with informal reasoning. Moreover, its rules are motivated by the 
BHK-interpretation in a direct way. 


1.1. The language of predicate logic. The language ¥ of predicate logic is 
based on a (countably) infinite supply of variables vp, v,,... (as metavari- 
ables for variables we shall use x, y,z in this chapter), n-ary predicate 
(relation) symbols R5, R7,... (for all n © N; metavariables R, R’,...) and 
n-ary function symbols f,’, f/",... (for all n © N; metavariables /, g). The 
zero-place function symbols are also called (individual) constants (metavari- 
ables: c, d). 

This language can easily be generalized in two directions: (i) dropping 
the restriction to countably many symbols of each kind, and (ii) extension to 
a many-sorted language, with a collection of sorts I, and variables 


Up, V4, 04,... for each sort i€ J, a supply of n-ary relation symbols 
Rig’, k EN for each sequence ip,..., i, , of sorts in J, a supply of 
function symbols fjo:-+!"-»!, k € N for each ig,..., 4,1, 74 from I. 


The language of so-called higher-order logic will be treated as a special 
instance of a many-sorted language. 

Terms, and formulas on the basis of A, V, >, 1 ,V,4 as primitives, 
are defined as usual. ¢, s will be used as metavariables for terms, A, B,C, D 
as metavariables for formulas. P,Q usually denote atomic formulas. 
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The language of propositional logic Y, has only 0-place relation sym- 
bols; these are called propositional variables (metavariables P, Q). 


1.2. Examples of natural deduction rules. Suppose we have established B, 
repeatedly appealing to assumption A. This means that we have shown how 
to construct a proof of B from a hypothetical proof of A; thus, on the 
BHK-interpretation this means that we have established the implication 
A — B. In this conclusion A is not an assumption anymore (A has been 
cancelled, discharged, or eliminated as an assumption). Schematically we 
can render this as 


a 


A->B 


The crossing-out of A indicates that A has been eliminated as an assump- 
tion in the final conclusion. The type of inference just described is called 
implication introduction (— I) since in the final conclusion an implication 
sign is introduced. 

There is also an implication elimination rule (— E): if we have shown 
A — B and A, we can also prove B, since a proof of A — B must provide a 
construction to transform a proof of A into a proof of B. Schematically: 


A>B A 
B 


As an example of a deduction by means of — I, — E we show how to 
deduce A — C from assumptions A > B, B > C: 


A->B A 
——— > E 
Boc B 

-E 


— I (A cancelled) 


A-C 


At each horizontal line we have indicated the rule which has been applied. 
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For conjunction, we can similarly justify an introduction rule AJ and 
two (left- and right-) elimination rules, AE, and AE,: 


A B AAB AAB 
Serger AE, 


An example of a deduction based on these rules is 


ANAB A AB 
Al 


I (A A B cancelled) 


BAA 
ANB>OBAA 


Finally we consider a quantifier rule, namely all-introduction VI. Schemati- 
cally VI can be rendered as 


A(y) 
VxA(x) 


That is, if we have derived A(y) for a completely arbitrary y, we may in 
fact infer VxAx, since our derivation of A( y) serves as a schema which can 
be applied to any particular object in the range of the variable y—so we 
have the construction required by the BHK-interpretation for a proof of 
VxA(x). In an application of this rule, the proof of A(y) should not 
depend on other assumptions containing y, because then we could not 
regard y as being completely arbitrary. 

That without such a restriction we indeed arrive at false conclusions 
becomes clear by the following example of a “deduction” in which the 
condition is violated: 


A(x) 


wa AON — 1 (A(x) is cancelled) 

A(x) > VyA(y) 

The deductions in our examples above may be regarded as labelled trees, 
the label attached to a node consisting of a statement, together with the 
name of the rule which has been applied to obtain the statement. For 
reasons of readability we have put the name of the rule next to the 


Sect. 1] Natural deduction 39 


horizontal lines instead of next to the formula. Thus the trees in our 
examples of deductions (of A > C from A > B, B-> C andof AA B- 
B A A) are 


1.3. Inductive definition of logical deductions. We are now ready to give a 
more formal definition of deduction, open assumption, and cancelled 
(= discharged = eliminated) assumption. The definition will take the form 
of a simultaneous definition by recursion on the height of the deduction 
trees (or, equivalently, the form of a simultaneous inductive definition). 

We shall use 9, possibly sub- or superscripted for arbitrary deductions. 
We write 


B 
B 


to indicate that B is the conclusion of J (so the occurrence B is part of Z 
itself). We use [ A] for a (possibly empty) set of occurrences of a formula A 
in a deduction; thus 


[4] 
2) 
B 


is a deduction 2 with conclusion B, containing a set [A] of occurrences of 
A, the elements of which are used as assumptions in 2. As a rule (cf. the 
“CDC” in 1.4(ii) below) we assume that [A] contains al] open assumptions 
of the form A in &. 


DEFINITION (of deduction, open and cancelled assumptions of a deduction). 
Basis. The single-node tree with label A (i.e. a single occurrence of A) is 
a deduction from the open assumption A; there are no cancelled assump- 
tions. 
Inductive step. Now let 2,, 2,, B, be deductions. A deduction D may 
be constructed according to one of the rules below. Some of these rules are 
subject to restrictions to be specified afterwards. 
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For 1 we have the intuitionistic absurdity rule 
2, 
ae 
1;— 
A 


For the other logical operators we have introduction and elimination rules. 


Introduction rules (I-rules) Elimination rules (E-rules) 


2, 2, QD, 2, 
A B AAB AAB 
Al AT Eira Ei 3 
[A] 
2D, 2D, 2, 
B ASB A 
14S 7 
[A] [B] 
2, 2, 9, 9, Y, 
A B AVB C C 
WAVE ave. - C 
DQ, 2, 
VI A VxA 
VyA[x/y] A[x/t] 
[A] 
a, a, 2, 
A[x/t] AyA[x/y] C 
al 3xA E Cc 


Open and cancelled assumptions are given by the following stipulations 

(i) In — I-applications all open assumptions of the form A in Q,, 
indicated by the set [A] are cancelled; in an application of V E the sets [A] 
in 2, and [B] in 2, are cancelled; in JE the set [A] in @, is cancelled. 

(ii) If B, is a deduction of a premiss of the last rule application in 2, 
then the assumptions open in 9, remain open in 9 except as specified by 
(i). Thus assumptions which are not cancelled are open. 

Instead of cancelled we also use discharged or eliminated. 

The rules for the quantifiers are subject to the following restrictions. 

(iii) In VE and AI ¢ must be free for x in A. 
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(iv) In VI 2, must not contain open assumptions containing x free, and 
y =x or y is not free in A. In JE Y, must not contain open assumptions 
containing x free except the set [A]; x not free in C, y = x or y not free 
in A. 

If A is among the open assumptions of a deduction 9 with conclusion 
B, the conclusion B in @ is said to depend on A in Q. From now on we 
regard “assumption of 2” and “open assumption of 2” as synonymous. 

O 


1.4. REMARKS. 

(i) We have assumed that the names of the rules are attached to the nodes 
of the tree. In most cases however, the form of the statements itself already 
determines the rule being applied, so that we can often omit the names of 
the rules when we are actually exhibiting deduction trees. Nevertheless, 
adding names of rules next to the horizontal lines marking the applications 
of the rules often facilitates following the proof. 

(ii) Another device helping us to grasp the structure of a given deduction 
is to number the (occurrences of) assumptions at top nodes which are being 
discharged lower down in the tree, and to repeat the number near the node 
where the discharge takes place, either at the line marking the application 
of the rule causing the discharge, or placed next to the conclusion of this 
application. Assumptions which are discharged simultaneously may be 
given the same number. 

Strictly speaking, the numbering of discharged assumptions is redundant 
information: by our definition as given above any assumption is discharged, 
if at all, at the earliest opportunity. From a given deduction tree we can 
therefore unambiguously read off, even without numbering, the node where 
an assumption is discharged (if at all). This discharge convention we call 
the crude discharge convention (CDC); we shall stick to it throughout except 
in section 10.8. 

(ill) The rules VI and JE may be simplified to 


[A] 

DB, 2, 2, 

A axA__C 
Mig: aE Se 


where in VI A does not depend on open assumptions in 9, containing x 
free, and in JE C does not contain x free and the deduction 2, of C does 
not contain open assumptions containing x except [A]. The more general 
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form of the rules given before can be obtained from these special cases: 


2, [A] (2) 
A (1) A[x/y] 2, 
VxA 2, 4xA Cc 
A[x/y ] AyA[x/y] C(2) 
VyA[x/ y] (1) C 


We find it more convenient however to combine VI, JE with the possibility 
of renaming bound variables, since we regard formulas differing only in the 
names of their bound variables as isomorphic. O 


The reader may now check for himself that all the rules given above are 
justified by the BHK-interpretation. The rules given are adequate in a 
practical sense: they cover all logical laws needed in a formalization of 
intuitionistic or constructive mathematical practice. It is natural to ask 
whether the rules are also complete, i.e. whether they permit us to derive all 
(intuitionistically or constructively) valid theorems. This delicate problem 
will be discussed in chapter 13, more particularly in sections 2 and 3 of that 
chapter. 


1.5. EXAMPLES. 


Example (a). We have argued before, in 1.3.3, that —(A V —A) should 
hold for our reading of ~, V; here is a formal deduction, corresponding to 
the informal argument we gave before 


(1) A 


VI 
2) -(AV-—A) AV—A 
se 
fae | 
(1) =A VI 
(2) -(A V AA) AVA 
FF 
1 
I 


ene Gey | eae 


(recall that ,A = A > 1). 
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Example (b). 


(1) VWxAx 
(2) sAy Ay 
dae I 
= 
(3) =-=AVxAx GQ) AVxAx 
SE 
ae 
(2) —Ay 
VxAAx 
(3) —5 VWxAx - VxAn-=Ax 


-E 


VI 


>I 


Confronted with the problem of constructing a deduction tree for a given 
statement, the best tactic is often to start from below, and to assume, if 
possible, that the final step was an instance of an I-rule. This is then 
repeated as long as possible. Thus, in attempting to construct a deduction 
for ==VxAx —- Vx Ax, we reduce the problem to the construction of a 
deduction for 1 from assumptions —4VxAx,-Ay. Now we attempt to 
apply E-rules to the assumptions. In order to apply an E-rule to —=VxAx, 
we need {VxAx, so we attempt to find a deduction 


77) 
AVxXAx } 


again assuming the conclusion to be obtained from - I, we look for a 
deduction &’ such that 


[AAVxAx][4Ayp][WxAx] 
Gg’ 
Ao 
=a VxAx 


and &’ is easily constructed as 
VxAx 
aAy Ay 
L 
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Example (c). 


(i) A AA (2) 
aa 


3) and 54 2) and gy _B BQ) 


+ i 
(1) AA (1) aR 
(3) ~174 >—A (2)B> —B 


If in the second derivation we take A for B, we see that ,A —- —=—4; 
hence combining these results we find 


aA =A. 


Example (d). The tactics described under example (b) are not always so 
straightforward in their application; consider e.g. the following deduction 


()n4. A (2) 


By (3) 1 
(4) A> 3xBx A (1) A= By By 
IxBx dx(A > Bx) A- By (2) 
(5) AVA 3x(A > Bx) (3) 3x(A > Bx) 


4x(A > Bx) (1) 
(A > 4xBx) ~ 4x(A > Bx) (4) 
AV «A >[(A > 4xBx) > 4Ix(A > Bx)] (5) 


A straightforward attempt to build from below suggests us to obtain 
“4Ax(A > Bx)” from “A > By”, but then we would be stuck; so we must 
reckon with the possibility, while breaking down the conclusion, that V E- 
or JE-applications intervene when going upwards. 
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Example (e). 


(2) nA A (1) 


ai 
B 
(4) ~4>B) A>B() (3) B 
L (4) (A> B) AB 
28 +B (3) 


4 


Some other logical laws which we shall need in the future are 
(f) (4 >B)> (48> QA) 

(g) —--(4A > B) > (A> 8) 

(h) =i(A AB) e544 A AB. 


We shall leave their proof as an exercise. 

Note that all logical deductions are schematic in character: replacing the 
letters A, B etc. by compound statements throughout transforms correct 
proofs into correct proofs. 

In practice one uses many short cuts in giving derivations; any logical 
law may be used as an assumption which is not discharged but does not 
count as a premiss for the conclusions; similarly we can use deductions 
from assumptions as additional (derived) rules, e.g. the derivation of 
A — C from A — B, B > C (cf. 1.2) permits the introduction of a derived 
rule such as 
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As an example of a deduction involving derived rules, consider 


y 2, 
(A > 9B) > (525B 3 5A) (495 B 2 3A) > (A 8B) pp 
2, 


‘ (A - -B) > (.54 - 8) a7 B<> —B 


(A > =B) ~ (AA > iB) 


R 


where 2,, G, exist by (f), D, by (c), and where rule R’ is a schema of 
substitution for logically equivalent statements: 


Combining this conclusion with (e), (g) we find 


1.6. Classical logic is obtained by strengthening the absurdity rule to the 
classical absurdity rule 


[4] 


1 
= 


The assumptions indicated by [—A] are being discharged. Observe that by 
ol 

{A] 

ae 

aA 


>I 


and that therefore the adoption of (1 .) is equivalent to having proposi- 
tions of the form —A — A as axioms: 


[—A] 
2 
a 


AS] 


A ae 
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Classically, we can drop V ,J and their rules, by defining them as 

AV B=—(A A-—B), 

3xA(x) = “Wx—A(x). 


We leave it as an exercise to show that the rules for V,3 can indeed be 
obtained as derived rules from the rules for A, 7, V. 


1.7. Deductions from axioms. So far, we have only considered purely logical 
deductions. In mathematical theories there are also non-logical axioms. For 
example, in proving identities for abelian groups, we make use of the group 
axioms, such as commutativity: 


VxVy(xt+y=yt+x). (1) 
Such axioms are often presented in open form as 
x+yaytx; (2) 


the universal quantifiers are tacitly omitted. Another way to look at this 
open form is as a schema: we may substitute ¢ and s for x and y or 
“deduce” the form (1) from it. 

There is a completely uniform way of accommodating such mathematical 
axioms in our natural deductions: we permit them to appear at the top 
nodes of our deductions. However, if we permit the use of open forms such 
as (2), we must strictly distinguish betweem axioms and open assumptions: 
an axiom appearing at a top node does not count as an open assumption. In 
particular, in the restrictions on VI and WE, we have to exclude only those 
variables which appear free in assumptions. 

VI permits us to derive the universal closure of an axiom from the axiom: 
applying VI twice to (2) leads to (1); and from the universal closure by VE 
we obtain all special instances, such as ¢ + s = s + ¢ in our example. 

Another way of achieving the same effect is to treat axioms as “rules with 
an empty premiss”; in our proof trees this can be indicated for example as 


g 
x+y=ytx apie xty=ytx 
Now we do not have to distinguish between assumptions and axioms at top 
nodes; instead, we have a new type of rule. 
If we agree to present axioms always in their universally closed form, 
there is no need to distinguish between axioms and assumptions any more: 
we simply permit axioms to appear as (undischarged) assumptions. 
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The choice of presentation is a matter of taste and/or convenience. 
Proofs in natural deduction style in mathematical theories are often pre- 
sented in linear form, since tree presentations become unwieldy for com- 
plicated arguments; for an example of a proof in linear style see 3.2.4. 


1.8. Logic with equality. In classical logic, equality is mostly regarded as 
part of the logic. In constructive mathematics, the equality of the language 
is often interpreted as a defined notion, not intuitively given as a primitive, 
and then it is more natural to think of equality as mathematical rather than 
logical. As axioms we can take (the universal closures of) reflexivity 


REFL X=X 
and the replacement schema 
REPL A(x) \x=y—>A(y). 


[f we take in this schema A(x) := (x = 1), A(t) is true by REFL; so from 
A(t) A t=s—A(s) we see that t=s—>s=1. Also, taking A(x) := 
(x =r), we find 


f=rAt=s-s=r. 


Another way of accommodating equality in a natural deduction context is 
to have an axiom x = x, or a zero-premiss rule 


REFL t=t 
and a replacement rule 


A[x/t] t=s 
A[x/s] 


with as a special case the transitivity rule 


REPL 


TRANS pot =s 
r=s 

1.9. DEFINITION. We shall use IPC as the designation for the system of 

intuitionistic propositional logic, and IQC for intuitionistic predicate logic, 

with or without equality, with or without function symbols and constants. 

Classical propositional logic CPC and classical predicate logic CQC are 

obtained by adding the rule 1, to the corresponding intuitionistic systems. 
Oo 
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We shail encounter in the sequel other formalizations (that is to say, 
different formalisms generating the same set of theorems) for IPC, CPC, 
IQC and CQC, but we shall not distinguish between these formalisms 
unless the structure of deductions is under discussion (as for example in 
section 4 below, and in chapter 10), or when the equivalence of these 
systems has to be proved. For example, we occasionally use N-IPC, N-IQC 
to distinguish the natural deduction versions of IPC and IQC described 
above from other versions under discussion. 


1.10. One-variable formulas in IPC. As an illustration of the complexity of 
IPC, compared to classical propositional logic CPC, we consider the 
one-variable formulas in IPC, generated from a single proposition letter P 
by means of the logical operations — , A, V and 1 . Modulo provable 
equivalence all such formulas occur in a sequence (A,,(P)), emu fw) given 
by 

Ao(P) = 1, A(P)=P, A,(P)=-P, 


Aonti(P) = Agg_(P) V Aa, (P), 
A on42(P) = A2,(P) ee Aon-i(P), 
A,(P) = P->P. 


Ao 


Implications hold between the A,, as indicated in the diagram above (A, 
below A, connected by edges means that A; > A, is derivable in IPC). The 
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diagram is called the Rieger—Nishimura lattice, after Rieger (1949) who first 
investigated the one-variable formulas, and Nishimura (1960) who redis- 
covered the structure. See also E2.1.6 and subsection 13.5.5. 


2. Logic with existence predicate 


2.1. So far we have tacitly assumed that the expressions (terms) ¢ which 
appear in rules such as VE, JI always denote some individual of the domain 
over which the variables are supposed to run. For example, in discussing 
arithmetic we can freely combine 0 and variables by +,- to obtain new 
expressions for natural numbers, since + and - are total functions on 
N XN. 

However, in mathematics one frequently uses partial functions such as 
x + x7! on the real numbers, which is not defined for x = 0. On the basis 
of classical logic we can avoid the consideration of partial functions by an 
ad hoc stipulation, putting e.g. 0~' = 0 in the case of the inverse function. 
But, constructively, this does not give us a total function. Here a possible 
solution might be to introduce a new sort of variables ranging over 
R*:= {x ©R: |x| > 0}, the set of reals positively distinct (apart) from 
zero, and to consider ~! only on R*. If we consider many distinct partial 
functions from R to R, this type of solution would require us to introduce 
many additional sorts. This is often impractical and inelegant. 

If, on the other hand, we admit in our logic partially defined terms, we 
have to revise the quantifier rules; thus for example the instance of the 
VE-rule: infer A[x/t~*] from VxA, must be subject to a stipulation that 17? 
exists, otherwise we take the conclusion to be either meaningless or false. 


2.2. E-logic. We shall now indicate how to adapt our logic so as to take 
into account the possibility of undefined terms, i.e. terms which do not 
always “denote”. The contents of this section may be skipped until needed. 
In the “mathematical” chapters 5-8, we occasionally need 2.2-4; the rest 
of this section is relevant to metamathematical investigations only. 

As part of the logic we assume a special predicate constant E, where Et 
has the intuitive interpretation “t exists”. E is called the existence predicate. 
Variables will be treated as purely schematic, that is to say, any term may 
be substituted for a free variable. Quantifiers on the other hand are 
supposed to range over existing objects only. This leads to the following 
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modified quantifier rules: 


[Ex] 
of OA VxA Et 
Wl WyAlx77] YES AL 
[AILEx] 
Alx/t] Et AyA[x/y] C 
(a | er 
ey 3E a 


The variable conditions are straightforward modifications of the condi- 
tions in 1,3, 

In addition, we have to express the schematic role of free variables. This 
can be done by adding a substitution rule 


SOE 
A[x/t] 


provided the deduction of A does not depend on assumptions containing x 
free. 

It is easy to see that for purely logical deductions (i.e. without axioms), or 
deductions from sets of sentences, we do not need the substitution rule: it is 
derivable. 

An open axiom cannot, in general, be replaced by its closure: VxA(x) 
says something about “existing x” only, and A(t) can be obtained from 
VxA(x) only under the additional assumption Et; on the other hand, the 
rule SUB permits us to derive A(t) from A(x) without assuming Et. A 
typical example is the “theory of global existence”, axiomatized by the 
single open axiom Ex. In particular, this axiom implies that all functions 
are total. 


SUB 


2.3. E*-logic. In this variant we assume the free variables to range over 
existing objects, though terms need not denote anything. 

Now we have axioms Ex for all variables x, and quantifier rules 
VI, 3E, 31£, VE®. A substitution rule ought to have two premises 


A Et 
A[x/t] 
but this rule is in fact derivable from VI, followed by VE¥. 


On the other hand, where formerly we had an axiom A(x), which then 
by the substitution rule gave A(7), we now have to treat all A(t) as axioms. 
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As to the exact relationship between this and the preceding version, see 
exercise 2.2.4. 


2.4. E-logic and E *-logic with equality. We now add equality to E-logic and 
E*-logic, with the intuitive interpretation: ¢ = s iff ¢ and s both exist and 
are equal. Thus we have an axiom schema 
t=1© Et 
(and so E could be defined in terms of =). Defining 
t=s:=EtV Es -t=s, 
we have a replacement schema 
A(t)At=s—>A(s). 


The intuitive meaning of = is: ¢ is defined iff s is defined, and if they are 
both defined, they are equal. 

Strictly speaking, this would suffice as a basis for logic with E and =. It 
will be convenient, however, to make an additional assumption of strictness 
for all primitive functions and relations, i.e. 


Ef (to, .++5t,-1) > Et; 
Rog sas ty) SE 


for each n-ary function symbol f of ¥, and each n-ary relation symbol R 
of &. For functions STR expresses the fact that the function value can exist 
only if all the arguments exist. As a result of STR, if a compound term 
refers to an existing object, then all its subterms must refer to existing 
objects. STR imposes a similar restriction on the basic relations of the 
language. But it should be noted that the property of strictness, by its very 
nature, does not extend to all logically compound predicates; that is to say, 
in general we do not have A(t) > Et: one only has to take Ex for A(x)! 

Once we have adopted STR, the meaning of Et and t = s for compound 
terms can be explained in terms of equality between less complex terms, 
since the following are derivable (exercise) 


Ete t=10 34x(t=x), 


STR | (i<n) 


t=s eo 4x(t=xAs=x), (1) 


ft=xeay(y=TA fy =x). 


2.5. NOTATION. IQCE is intuitionistic logic with equality and E based on 
the propositional rules of 1.3, the quantifier rules of 2.2, equality axioms 
and STR as in 2.4. Finally, IQCE* is E*-logic (cf. 2.3). O 
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2.6. In the presence of STR, we can simplify the equality axioms: 


PROPOSITION. The replacement schema 

A(t)At=s—7A(s) 
is derivable on the basis of the other axioms and rules of IQCE or IQCE* 
from the special cases: 


RiAt=5-— R35, (1) 
E( fs) At=5- ft= fs. (2) 
PROOF. By induction on A. We will only treat the atomic case, the other 


cases are left to the reader. For simplicity we shall only consider one-place 
predicates and functions. For a predicate P: 


t=sA Pte (by definition) 
(ErV Es>t=s)APt=> (by propositional logic) 
(Et>t=s)APt> (by STR) 
(Et>t=s)ANPtANEt=> (by propositional logic) 
t=s A Pt = Ps (by assumption). 
Similarly for function symbols: 
t=sAEtAE(ft) > ft=fs (by(1)), 
hence since by STR E( ft) > Et, also 
E(ft)At=s-ft=fs, 


and therefore t= 5 —> (E(ft) > ft = fs); similarly t= 5s —> (E(fs) > 
ft=fs),sot=s— (E(ft) V E(fs) > ft = fs), and thus t= s > ft = fs. 
O 


2.7. Axiomatizing IQCE with = as a primitive. We now consider E-logic 
with VIE, a1®, VEF, JEF and a primitive binary predicate = satisfying 
x=y A A(x) > A(y), 
Vi(x=zeoye=z)>x=y. 


We call the resulting system IQCE*. 


PROPOSITION. Define in IQCE* 
t=si=EtNEsAt=s. 
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Then IQCE* becomes a definitional extension of IQCE; conversely, defining 
in IQCE 
t=s:=EtV Esot=s, 


IQCE becomes a (definitional) extension of 1QCE*. The systems are equiv- 
alent in the sense that 


IQCEb t=s0(EtA Es A (EtV Es>t=s)), 
IQCE* + t=s 0 (EtvV Es > (Et A EsAt=s)). 


We can also formulate this by saying that IQCE and IQCE* have a common 
definitional extension. 


PROOF. We consider IQCE*. From Vz(x = z@ y =z) > x = y we obtain 
x = x; hence by replacement x =~yAx=x>y=x,SOX=yry=x, 
and alsox =~yAy=z—>x =z (again by replacement). 

Let ¢ = s; if Et, then also Es and vice versa, so (Et V Es) \t=s-> 
Et \ Es A t = s, hence 


t=s—>(Etv Es>(EtA Es At=s)). 


Conversely, if Et V Es > t = s, and we assume Ez, t = z then Et. There- 
fore t = s, and hence s ~ z and vice versa, so Vz(t = z@s =z), and 
therefore also f = s. 

We leave the argument for IQCE as an exercise. O 


2.8. The relation between IQCE and IQC. The following proposition, 
proofs of which will be given in 10.5.6 and 13.6.16 settles the relation 
between IQCE on the one hand and IQC with special distinguished 
predicates E and = on the other hand. 


PROPOSITION. Let IQCE be as above and let IQC* be 1QC with the equality 
axioms for = and a predicate E relative to which STR holds. We define a 
mapping © from the formulae of IQCE to IQC* by 


E 
(Rigsscte ca) Fo Rey 6b Gs 


(t9=t,)"° =EtA Et Aty=t; 
(Ac B)” = AFo BF fore E{A,V, >}; 
(wxA)” = Wx[ Ex > A*]; 


(axA)” ‘= 3x[ Ex A A?]. 
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If = appears as a primitive, we can add a clause 
(to = t))" = Ety V Et, > ty = hy. 

Then 
IQCEF A IQC*F A’. O 


2.9. Descriptions. We can still further strengthen our logic (either E-logic or 
E*-logic) by adding a descriptor or description operator Ix.A(x), “the 
unique x such that A(x)”. If there is no unique x such that A(x), Ix. A(x) 
is undefined; thus in particular Ix. A(x) is undefined if there are several x 
such that A(x). 

The intended interpretation of Ix. A(x) is adequately captured by the 
description axiom 


DESCR  Vy(y = Ix. A(x) @ Vx(A(x) Ox =y)). 


For example, E(/x.x = x) iff the range of the variable x consists of a 
single element, and ~E(Ix.x # x). 

In the case of E-logic the addition of a descriptor with axiom DESCR to 
open theories may result in inconsistency: let T be a theory (a fragment of 
first order arithmetic for example) in which we can show A(x, y) A 
A(x, y’) > y =y’ and =VxdyA(x, y) for a suitable A, and such that 
T U {Ex} is consistent (in other words, T is compatible with “global 
existence”); then T’ = T U {Ex,DESCR} is inconsistent, at least if we 
interpret SUB in the new theory T’ as applying to all terms of T’ and not 
just to the terms of T. 

Warning. In the case of CQC it is a wellknown fact that the addition of 
“Skolem-functions” (i.e. choice functions) is conservative, but this is not so 
in the case of IQC; see 5.13, proposition. 

The essential properties of Ix.A(x) in connection with E are given by 
the following proposition 


2.10. PROPOSITION. Let y € FV(A). Then 
(i) E(Ix. A(x)) @ AyVx[A(x) @ x = y]; 
(ii) E(x. A(x)) — ACUIx. A(x)). 
Proor. (i) Apply DESCR with Ix. A(x) for y, then 
E(Ix.A(x)) @ Wx[A(x) © x = Ix.A(x)] 
> 4yVx[A(x) ox =y]. (1) 
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Conversely if JyWx[A(x) @ x = y], for some y Vx[A(x) @ x =y], Ey 
and with DESCR y = Ix. A(x), therefore E( Ix. A(x)). 

(ii) Apply (1): EUIx. A(x)) > (A(x. A(x)) @ Ix. A(x) = Ix. A(x)), and 
thus (ii) follows. O 


As we shall see in section 7, under suitable conditions on the theories 
considered, the addition of a description operator is conservative, in fact a 
definitional extension. 


2.11. REMARK. It should be observed that the introduction of a description 
operator satisfying DESCR has the same effect as introducing function 
symbols for definable partial functions. For suppose 


k A(X, y) A A(X) oy sy’, 
then we may regard Iy. A(X, y) as a partial function in X, satisfying 

t AyA(X, y) o A(X, Iy. A(X, y)) A E(Iy.A(%, y)). 
Conversely, suppose that, whenever - A(x, y) A A(x, y’) > y = y’, we 
may introduce a constant p, for a partial function such that 

b AyA(X, y) @ A(X, px) A Eq, x. 
Then we are also able to construct a term with the properties of Iy. B(y), 
for any B(y): let B(x, y) be given and consider 

A(x, y) = Vz(B(X,z) ey =z). 


Then clearly + A(x, y) A A(X, y’) > y =y’. Finally, take ,(X) for 
Ty. B(x, y). We leave the verification of details to the reader. O 


3. Relationships between classical and intuitionistic logic 


In this section we shall show how CQC can be embedded into IQC by 
means of the various “negative translations”. Furthermore, we shall obtain 
a number of conservation results of CQC over IQC, i.e. results of the form: 
for ali A belonging to some syntactically defined class ¥ of formulas 


CQCtL A> IQCE A. 


The basic facts established in 3.2-3.8 will be frequently used later in the 
book. The remaining part deals with refinements, to which we shall refer 
only occasionally later on. 
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3.1. DEFINITION. A formula A in a first-order language is said to be in the 
negative fragment (or “A is negative”, not to be confused with “A is a 
negation”) if prime formulas P occur only negated (i.e. in a context 
P—1)in A, and A does not contain V or 3. O 


3.2. DEFINITION. Let MQC, the so-called minimal predicate logic, be the 
system IQC with the rule 1; left out. 

We use as abbreviations -,,+,,,; for derivability in CQC, MQC, 
IQC respectively. O 


N.B. In MQC 1 is treated as an arbitrary proposition letter. 


3.3. LEMMA. For A negative MQC + A @ ——A. 


Proor. As seen by inspection of 1.5 and E2.1.1, the following are all 
provable in MQC 

——(A N B) >A A-=-=B; 

—-(A = B) => Gaia a —~B) had (A = —B); 
The proof now proceeds by induction on the complexity of A: A has one of 
the forms =P (P prime), B AC, B > C, WxB. Consider e.g. the case 
=(B — C); the second implication holds by the induction hypothesis. We 
leave the other cases to the reader. O 


3.4. DEFINITION. For all formulas of predicate logic the (Gédel-Gentzen-) 
negative translation ® is defined inductively by 


(i) P& := —P for P prime; 
(ii) (A A B)® == AEA BS; 

(iii) (A > B)® = A® > BE; 

(iv) (WxA)® = VxA8; 

(v) (A V B)® == -(4A8 A B®); 
(vi) (AxA)® == =Vx4®% O 


REMARK. An inessential variant is obtained if (i) is replaced by 
(i)’ P® := —4P for P prime, P#1; 12=1. 
Also we may systematically replace occurrences of ——— by =. O 
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3.5. THEOREM. For all A 
@ +, 4 A% 

(ii) Pr, Ao TR, AS, 
where F8 = { B®: BET}. 


ProoF. It is a matter of routine to establish the equivalence of A and A® in 
cac. 

The proof of (ii) from right to left proceeds by induction on the height of 
deduction trees in CQC. 

Each application of a rule for A, > ,V can be replaced, under the 
translation, by a corresponding instance of the same rule; for example, if 
we have already obtained deductions 


DEBE 
A& Bs, 
corresponding to 
QR DB 
A B, 
then 
2, DB. : DBE DB} 
ans 3 translated into ABA BE: 
As for 1L,, suppose that the deduction 
[.4*] 
ge 
aa 
has already been constructed, and consider the following instance of 1, 
[=A] 
RQ 
t 
A 


Then the proof tree below is a correct deduction of A® in MQC: 
(1) [4°] 
Ge 
1 go 


(1) A848 > AP 
A® 
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where 2’ is the derivation in MQC of —— A® — A® which exists by lemma 
3.3. 

As for the rules for V,3, it is well-known that in CQC V and 3 are 
explicitly definable by A V B= (4A A -=B), 4xA = AVx—A, in the 
sense that the rules for V,3 can be obtained as derived rules for these 
defined connectives. ® replaces V,4 by their classical definition, and the 
rules for the other operations have already been shown to be valid under ®. 

The proof of (ii) from left to right follows from the first part of the 
theorem. O 


3.6. COROLLARY. For negative A, CQC 1+ A ifIQCF A. O 


This corollary shows that, in a sense, CQC is contained in IQC, since each 
formula of CQC is equivalent to a negative one. 


3.7. Other versions of the negative translation. One of the best known 
variants is Kolmogorov’s negative translation *. A* is obtained by simulta- 
neously inserting double negations in front of all subformulas of A (includ- 
ing A itself). Inductively we may define * by 


P* := P for P prime, L*:= 1 ; 
(Ac B)* = (Ako B¥) fore €{A,V, >}; 
(QxA)* = Ox A* for Q € {v3}. 


Another variant A‘ (Kuroda’s negative translation), due to Kuroda (1951) 
(“q” from “quantifier”) is obtained as follows: insert 4, after each 
occurrence of V, and in front of the whole formula. 


3.8. PROPOSITION. MQC + 48 @ A*, IQC+ Ak © AY, 
Proor. By formula induction (exercise). O 


The remainder of this section is devoted to refinements and generaliza- 
tions. 


3.9. Subformula occurrences. For the methods and results below we need 
the notion of a “(sub)formula occurrence” (sfo, for short), rather than the 
notion of a (sub)formula. An sfo of a formula A may be specified by a 
formula together with its position in the construction tree (parsing tree) of 
A, e.g. by the code of a node. 
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EXAMPLE. A = Vx(P(x) > Q) > Q, P(x) and Q prime, has the follow- 
ing construction tree: 


P(x) ¢0,0,0) (0,0,1) 


P(x) > ae we 


Wx(P(x) > Q) (0) (1) Q 


Wx(P(x) > Q) > Q) 


There are two distinct occurrences of Q, one at (0,0,1), another at (1). An 
elegant method of marking occurrences is the method of contexts or 
formula-occurrence schemas, in which a placeholder symbol * marks the 
position of the sfo we are interested in. Syntactically * is treated as a prime 
formula. Thus, for example, the two sfo’s of Q in A correspond to the 
contexts B[*] = Vx(P(x) ~ *) > Q, and C[*] = Vx(P(x) - Q) > * 
respectively. Note that A = B[Q] = C[Q]. 
Formally we define the class @O.W of formula contexts as follows. 


DEFINITION. Let A, F be metavariables for formulas and formula context 
respectively. Then @O/% is inductively generated by the clause 


*, ANF, FAA,AVF,FVA, 
A> F, F>A,VxF,AxF Ee CON. 


In other words, @OW is the least class of expressions 2 such that for all 
formulas A and all FE 2 also *, AA F,FAA,...,AxF €%. O 


3.10. CONVENTION. For the remainder of this section we use the notation 
F[+*/C], A[B/C] permitting that variables in C become bound in F, A 
(i.e. we permit FV(C) M BV(F) #8). Where the condition FV(C)N 
BV(F) = @ is needed, it will be stated explicitly. This represents a deviation 
from our customs elsewhere in this book. 

For formula contexts F = F[*] we shall often write F[B] instead of 
Fl*/B). O 


Note that, for given A, each sfo in A corresponds to a uniquely 
determined F €.¢0Y such that A = F[B] fora suitable B. An sfo in A is 


Sect. 3] Relationships between classical and intuitionistic logic 61 


therefore specified by a pair (F, B) with F € @OYW such that F[* /B] = A 
(or equivalently one might have taken the pair (F, A) such that for some B 
F[B] = A). 


3.11. DEFINITION. We define simultaneously the positive contexts POS 
and the negative contexts VEG, Let P, N be arbitrary elements of POY, 
N€éG, and A an arbitrary formula. Then JOS, YEG are simultaneously 
inductively generated by the clauses 
(i) *, AAP, PAA, AVP, PVA, A>P, N>A, WxP, AXP E 
POF: 

(ii) AA N, NAA, AVN, NV A, A >N, PO A,VXN, AXN © NEG. 
Note that GOW ':= POLU NE and that POLSN NEY = B. 

It is now obvious what is meant by a positive (negative) occurrence of B 
in A. O 


EXAMPLE. For the A in 3.9 we see that 

C[*] = Vx(P(x) > Q) > * € POF, C[Q]=A; 

B[*] =Vx(P(x) > *) >QENEY, B[Q]=A. 
3.12. LEMMA. Let z be a list of variables free in B and C, and bound in F{B] 
and F[C], where F{[*] © GOW. Then 


(i) FE POS= VB > C)+,, F[B] > FIC]; 
(ii) FE VEG = VB > C)F,, F[C] > FIB). 


Proor. We prove (i) and (ii) simultaneously by induction on the construc- 
tion of F. 
Case 1. F = *: immediate. 
Case 2a. F = WVxD(x, *) © POF. The following deduction is correct 
VxD(x, B) Wx2(B > C) 
(D(x, B)] (VHB > C)] 


7) 
D(x, C) 
VxD(x, C) 


@ is given by the induction hypothesis. 
Case 2b. F = WxD(x, *) € V@¥Y: completely similar. 
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Case 3a. F = N[*] > A, N © NEG. In the deduction below the subde- 
duction @ exists by the induction hypothesis: 


[vz(B > ©] 
B 


N[C]> N[B] Q) MC] 
N[B] (2) N[B] > A = F[B] 
A 
(1) N[C]> 4 =F{C] 
(2) F[B] > F[C] 


Case 3b. F = P[*] > A, P © POF, 
Case 3c. F= A — P[*], PE POF, 


Case 3d. F= A > N[*], NE NEG, 
are treated similarly. We leave the remaining cases to the reader. O 


3.13. We shall now describe a generalization of the conservative extension 
result of 3.6, which can be applied to many theories formulated in (many- 
sorted) first-order predicate logic. 

We shall use formula schema, axiom schema for syntactic objects similar 
to formulas except for the possible presence of certain predicate symbols 
Pj, Pz, P3,.-- (regarded as being outside our standard languages for predi- 
cate logic), called place-holders. For example, the “induction axiom” is 
characterized by a schema 


p,(0) A Wx(my(x) > p,(Sx)) > ¥xp,(x), 


where p, is a unary (place-holder) predicate symbol. 

A formula is a formula schema with zero (i.e. no) place-holders. If I is a 
collection of formula schemas, A a formula, we write [+ A if A can be 
derived from instances of schemas in I’; F® is the collection of all A’, A an 
instance of a schema in I. 


3.14. DEFINITION. Let A[p,,..., ,] be an axiom schema. B,,..., B, are 
supposed to be free for substitution for p,,...,, respectively in A, ice. 
without loss of generality we may assume FV(B;) M BV(A) =. The 
schema is spreading if 


MQCt A[B,..., B28] > A[B,,..., B,]*; 
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the schema is wiping if 

MQC +r A[B,,..., B,]® > A[B8...., BE], 
and isolating if 

MQCt 4[B,,..., B,]® > =A A|BE,..., BE]. 


A set of schemas is M-closed under ® if [ +-,, A® for each instance A of a 
schemain lI. O 


N.B. The notions of spreading, wiping and isolating are relative to MQC. 
We may distinguish the corresponding notions for IQC as I-spreading, 
I-wiping and I-isolating. 

Since we may regard formulas simply as schemas without place-holders, 
the notions of spreading, wiping and isolating also apply to formulas. 

A formula of the form Vx(A — VyB), with A spreading, B isolating is 
called essentially isolating. O 


We note the following easy lemma. 


3.15. Lemma. If FV(A) BV(I'U (B}) =Q, then T+, B= I[ 4/4] 
t+, B[L/A] where [[ 1 /A] = (C[4/A]:C €T). 


PROOF. Obvious, since in MQC 1 is treated as an arbitrary proposition. 


3.16. THEOREM. Suppose I to be a set of schemas M-closed under ®, and let 

I'}+, A. Then 

(i) Jf A is isolating and 1 does not occur in I, A then +,, A. 

(ii) If 1 occurs in I only positively (i.e. forallC €T,C = F[*/1L] > F 
€ POF) and negatively in A, and A is isolating, then T -; A. 

(iii) As (i) for essentially isolating A. 

(iv) As (ii) for essentially isolating A. 

(v) JIfA is \-wiping then D+, A. 


PROOF. (i) Let A be isolating; assume '+-, A, then ['%+,, A®, therefore 
[t,, A® (LP is M-closed), hence [+ ,, =A (since A is isolating). Hence 
by the lemma 3.15 I[1/A]+,, (44A)[1/A], and since 1 does not 
occur in IU {A}, we have P+, (A > A) > A, ie. D+, A. 

(ii) Let BET, B* = B[1/A]. 1 occurs only positively in I’, hence by 
3.12 Blp,,..-, p,/A;,---, A,] > B*[p1,..., 0,/Ap---, A,]. Since A con- 
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tains 1 only negatively, we have by 3.12 |, A[1/A] — A. Thus from 
Tt, aA we now obtain [1 /A] +, (A[1/4] ~ 4) > A, so PF; A. 
(iii) Let A = VWx(B > VyC), B spreading, C isolating. Then if "+, A, 
also T, BL .C (y not free in I, B), and IU {B} is M-closed under & 
since B is spreading. This reduces (iii) to (i). 
(iv) is similarly reduced to (ii), (v) is trivial. O 


3.17. DEFINITION. We define simultaneously classes of axiom schemas 
SW, F (from “spreading”, “wiping” and “isolating”) as follows. Let P 
range over prime formulas distinct from 1 , p over expressions p,t,.-..t, 
(p, an n-ary place-holder symbol), S, and S, over Y, W, and W, over W, 
J, and J, over %. Then S,W, ¥ are inductively generated by the clauses 
(i) L, P, p, Sy A S,, S, V S5, VxS,, IxS), J, > S, € FS; 

(ii) L,p,W, AW, W, VW, VxW,, S; eo WL EW; 

(iii) P, W,, Jy: A Jb, J, V J, Ixd, ©. 

If we extend (iii) with an extra clause 


Sod Ee, 
we obtain wider classes denoted by ¥,, W;,, ¥,. O 


3.18. PROPOSITION. 

(i) If A[p,,..., P,] is a schema in S(W, F) then A is spreading (wiping, 
isolating). 

(ii) Similarly with SP, W,, ¥,) and \-spreading (I-wiping, 1-isolating). 


PRrooF. By induction on “, W, ¥, using the same identities as in the proof 
of 3.3. O 


By means of this proposition we can easily show for many schemas and 
formulas that they are spreading, wiping or isolating. 


3.19. DEFINITION. A d-formula is either a prime formula distinct from 1 , 
a disjunction or an existential formula. O 


The following proposition is a corollary to 3.18: 
3.20. THEOREM. 


(i) If there is no positive (negative) occurrence of a subformula of the form 
VxB in A, then A is isolating (spreading). 
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(ii) Jf A is the negation of a prenex formula, then 
CQCt A=>MOQCE A. 


(iii) Suppose that all positive (negative) occurrences of d-formulas in A occur 
in fact negated. Then A is wiping (spreading). 
(iv) If all positive occurrences of d-formulas in A occur negated, then 


CQCFA>MQCE A. 


PROOF. (i) by simultaneous induction, using 3.18. 

(ii) is an immediate consequence of (i). 

(iii) The assumption states in fact that whenever A = P[B](A = N[B)), 
B a d-formula, then P[+] = N[*/* — 1] for suitable N (N[*] = P[*/ 
* — J] for suitable P), where P © POY, NE WEY. Now use simulta- 
neous induction, with help of 3.18. 

(iv) is an immediate consequence of (iii). O 


We shall carry our refinements still further. 


3.21. Lemma. Let [jf (7) be the set of universal closures of all Wx AB > 
aAVxB such that WxB occurs positively (negatively) in A, then 


Py; A® >A, Ty +; aA OA. 
Proor. Simultaneously by induction on the complexity of A. For example, 
let A = C > D. Then Pf = [eG UT). By induction hypothesis 

TE, HC > C8, TZ, D§' > —D, 
and so 

ry EAC os. D)* ard (“AC ed AD) = aot ad Dy. 
If A = WxC, Tf =T¢ U H, where H is the universal closure of Vx C > 
—AVxC. By induction hypothesis [= +; C8 > —C, hence [7 +; (WxC)® 


> —AVxC, etcetera. O 


3.22. LemMA. Suppose A = F[=7C], F © GON, FV(C) 9 BV(WxA) = Q; 
then IQC + WxF[3C] > —VxF[C]. 
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ProoF. If FE VEY, VWz(C > =AC) holds ard we can apply lemma 
3.12(ii). If F Ee POS, we can find B, (by lemma 3.12(i)) and DB, (by 
intuitionistic logic) such that the following is a correct deduction 


(3) [VxF[-==C]] [-AC>C] (1) 
VxF[C] VxF[C] (2) 
Q, L 
SAAC SO). ~~ H{SSCO 6) @ 
ote 
WV xF[C] (2) 
VxF[4C] > VxF[C] 3) oO 


3.23. DEFINITION. The strictly positive contexts SPOS are inductively 
defined by 


*,AAS,SAA,AVS,SVA, A> S,VxXS,IxS € SPOFL, 


where S is a metavariable for elements of SPOS. 

A strictly positive occurrence of A in B refers to a pair (S, A) with 
SESPOFS, S[*/A] = B. 

A strictly positive subformula (or strictly positive part, s.p.p.) of B is a 
subformula which has a strictly positive occurrence in B. O 


Note that S#POY is properly contained in POS: Vx(* —- Q)E 
POS\SPOFS. 


3.24, DEFINITION. We define a special class of multiple formula contexts ¥ 

as follows. We have a countable set of symbols *,, *,, *,,...; F, F’ are 

metavariables ranging over #, A a metavariable for formulas. ¥ is 

inductively generated by the clauses 

(i) *,,VxF, AP FEF; 

(ii) suppose for all n and m, that n # m if *, © F and +, © F’; then 
FAF é€F. O 
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N.B. The condition in (ii) guarantees that each *, occursin Fe F at 
most once. Without loss of generality we can always assume an F © ¥ to 
be of the form F[*,,..., *,]- 

The intuitive significance of ¥ is easily explained. Suppose 
F[*,,....*,] EF and A= F{B,,...,B,], and let T be the tree of 
strictly positive occurrences in A; T is a subtree of the formula construc- 
tion tree of A. Then any branch in T meets one of the occurrences marked 
by *, without encountering a d-formula between 4 itself and the occur- 
rence marked by #,. 


3.25. Lemma. Let F[*,,...,*,] € 4%, BV(F) 0 FW(B,) = Bforl <i<n. 
Then 


IQC + VxF[B,,..., B,] ~ aaVxF[B,,..., B,]- 


Proor. One first shows, by a straightforward induction over #: IQC F 
maid | Biss wey B, | -> F [a8 sso B, |: 
Then we apply 3.22 n times. O 


The following proposition is now an easy corollary: 


3.26. THEOREM. 

(i) | Suppose that for each positive occurrence of a formula VxB in A there is 
an F €F with WxB = F[B,,..., B,], FV(B,) A BV(VWxF) = ®, then 
A is \-isolating, i.e. QC + A® — —A. 

(ii) (Mints, Orevkov) If, for any positive occurrence of a formula xB in A, 
there is no strictly positive occurrence of a d-formula in B, then A is 
[-isolating. 

(iii) (Cellucci). If, for any positive occurrence of a formula VxB in A, VxB 
is in fact of the form Wx(C(x) > D), x € FV(D), then A is 1-isolating. 

(iv) Suppose that T is M-closed under *. Then, if A =—A’ satisfies the 
conditions under (ii) or (iii), we have PL, AST, A. 


Proor. (i) Immediate, combining 3.25 with 3.21. 

(ii) Use (i) and note that the condition on A implies that each positive sfo 
of the form VxB is such that B= B[*,,...,+*,/1,..., 1] for some 
Bi eF. 
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(iii) Use (i) and note that the condition on A implies that each positive 
sfo in A of the form VxB is such that B = (C(x) > *, )[*, /D], and that 
C(x) > *, EF. 

(iv) A is I-wiping since A is I-isolating and ~~+.A’ @ 4A’ @ A hence 
3.16(v) applies. O 


4. Hilbert-type systems 


We shall now introduce codifications of intuitionistic predicate logic of a 
somewhat different type, to be denoted by the generic name of Hilbert-type 
systems. Deductions are performed from axioms by means of rules, but 
there is no cancelling of open assumptions. Hilbert-type systems are some- 
times easier to handle than natural deduction in proofs of metamathemati- 
cal results. The contents of this section are entirely technical, and the 
reading can be postponed until called for. 

The designations H-IQC etc. for the various systems introduced below 
are entirely ad hoc and are used only locally. 


4.1. The system H,-IQC. 
The first Hilbert-type system we shall consider is based on the following 
six groups of axiom schemata: 


m ANB->A, AAB-B, 
aaa A—>(B>(AAB)); 
ai A>AVB, BOAVB, 
ie (A>C)>((B2C)-(AVB>C)); 


— -Ax eter ee: 
(A> (B>C)) > ((A > B) > (A> C)); 


1 -Ax 1-7 A; 
A[x/t] > 4xA_ (¢ free for x in A), 
3-Ax ax(A > B) > (ayA[x/y] > B) 
(x €FV(B); y=xory ¢FV(A)); 


VxA > A[x/t] (t free for x in A), 
W-Ax Vx(B- A) > (B > VyA[x/y]) 
(xE€FV(B), y=xory € FV(A)). 
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In addition there are two rules: — E (modus ponens) and VI (rule of 
generalization): 


Vi Fb A =F VWXA. 


This corresponds to the rule VI of the natural deduction system in the case 
where no open assumptions are present. 

If we also permit assumptions (which of course cannot be cancelled) we 
use VI in the same way as in the natural deduction system. That is to say, 
we have 


VI THA=>ItVXA 


under the restriction that x € FV(I). This corresponds to Kleene’s notion 
of a “deduction from assumptions with variables held constant” (1952, 
section 22). If we regard the set I as a collection of axioms, possibly in 
open form, and permit the VI-rule without the restriction x €¢ FV(I’), we 
have Kleene’s “deduction from assumptions”. This corresponds to the 
treatment of non-closed axioms in 1.7. 

Below we shall only consider VI from hypotheses (assumptions) under 
the restriction x € FV(I). Therefore, if we consider deductions from 
axioms, we assume the axioms to be given in closed form. 

Let us use I’+,C to denote deducibility of C in H,-IQC from assump- 
tions I’, and let [+ C be used for deducibility in the natural deduction 
formulation N-IQC. Then we have a 


4.2. THEOREM. [+ Ciffl+,C. 


Proor. [+,C = IF C is easy: all the axioms of H,-IQC are derivable in 
N-IQC, and — E,VI are part of the rules of N-IQC. 

For the converse implications we must show how to transform a deduc- 
tion in N-IQC into a deduction in H,-IQC. We do this in two steps: we first 
show how to transform a deduction in N-IQC into a deduction in an 
intermediate system H with as only rules 31, > I, > E, but permitting 
axioms from A-Ax, V-Ax, 3-Ax, and axioms of the form VxA(x) > A(¢). 

The proof proceeds by induction on the height of a deduction tree in 
N-IQC. We have to show how we can successively remove all applications 
of AI, AE, VI, VE, JE, JI and VE. Suppose 2,, 2,, B; to be deductions 
in N-IQC of height less than k, and let Dj, D3, D; be the corresponding 
deductions in H. 
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To the left we show possible forms of a derivation @ of height k: to the 
right we indicate its transformation into a deduction H. 


a 
gd, gd, A>(B>(AAB)) A oy 
2 
A B B>(A AB) B 
ANB AAB 
g, DQ 
ANB AN B>A AAB 
A A 
2, 2 
A A->A VB A 
AVB AVB 
[A] 
D {B] 
Cf & 
(A>C)?((BRC)-=(AVB>C)) AC Cc 
[A] [B] (BC) ~(AVB>C) BoC 
2, 9, &, Sg | 
AVB C C AVB>C AVB 
G Cc 
gD, aN 
VxA (x) VxA(x) > A(t) VxA( x) 
A(t) A(t) 
J, D 
A(t) A(t) > 4xA(x) A(t) 
4xA(x) 4xA(x) 
[A(y)] 
%; 
Cc 
A(y) 7 € 
[A(y)] Wx(4>C) (4x4 2 C)  -Vx(A > C 
a, 9, x( ) > (ax ) x(A4 > C) 9 
AxA(x) C axA > C 3xA 


Cc Cc 
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The cases where 9 ends with an application of — E, > I, VI are of course 
trivial. 

Now we shall show, by induction on the height of a derivation in H, how 
to eliminate all applications of — I. 

First of all we observe that A — A can be established in H by means of 
the following standard deduction 9, using the schemas of — -Ax, where 
“AA” abbreviates “A > A”: 


[A > (AA > A)] > [(A > AA) > AA] A > (AA > A) 
(A > AA) > AA A-—AA 
AwA 


Consider any derivation 9 in H ending with — I, say @ is of the form 


[4] 
2, 
B 


A->B 


and let G be the result of eliminating — I from 2, (induction hypothe- 
sis). We have to show how to eliminate the final — I from 


We shall do this by showing how to transform each subdeduction Dy (with 
conclusion C say) of 2; into a deduction 
Ai 
A>C, 
by induction on the height of Zp. 


Basis-step: a top formula (occurrence) C not in the class indicated by [A] 
in QB; is transformed into 


CRE SC). 6 
2c 


and the top formulas in [A] are replaced by J,. 
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Induction step: B, can end with — E or VI. We again show to the left 
B,, to the nght ZF: 


DB, 9D; GD 
C-D C (Ax(CHD)) 7 (AC) > (AD) A>(CHD) Ox 
D (A>C)>~ (AD) A -C 
A-D 
DS 

2, A> Cx 

C(x) Vx(A > Cx) > (A > WxCx) Vx(A > Cx) 
VxC(x) A > WxCx 


Note that we have used our restriction on VI in 9}. Repeating this, we end 
up with 
GF 
A->B 
and we have eliminated the final — I in Y. By this method all — I- 


applications in a derivation in H can be removed, starting with the 
uppermost ones. O 


4.3. The system H,-IQC. The following system is almost the same as the 
one used in Kleene (1952): based on the axioms A-Ax, V-Ax, — -Ax, 
1 -Ax and 


A[x/t] - 4xA, WxA > A[x/t] (t free for x in A), 


and the rules — E and 


A>B B>A 
VI, > 2 a 
7A > WyB[x/y] ?AyB[x/y] > A 


where x € FV(A), and y = x or y € FV(B). 


THEOREM. Deducibility in H,-IQC is equivalent to deducibility in H,-IQC 
(and hence to deducibility in N-IQC). 


PROOF. Left as an: exercise to the reader. Some care is needed in transfor- 
ming applications of VI into applications of VI,. O 


4.4. The system H,-IQC. If we are interested in sentences only, we can also 
remove the rule of generalization. H,-IQC has as axioms all universally 
closed instances of A-Ax, V-Ax, — -Ax, V-Ax, -L -Ax, J-Ax; as its only 
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rule, we have a form of modus ponens 


Wx(A > B) XA 
VxB 


For sentences we can formulate the following 


— E, 


THEOREM. Deducibility in H,-IQC is equivalent to deducibility in H,-IQC. 
Proor. Exercise. 0 


4.5. Hilbert-type systems for IQCE and IQCE*. Let us first consider 
E*-logic. H,-IQCE* will be the system based on the axioms A-Ax, V-Ax, 
— -Ax, 1 -Ax and 


3-Ax— = A[x/t] A Et > 4xA_ (1 free for x in A); 
W-AxE  VxA A Et—> A[x/t] (t free for x in A), 
Ex, Et 4x(x=t), 
EAX 
Vx(x=x), t=sAt=srte=t, 
strictness for predicate symbols R and function symbols f 
RlGsics,t,) SERS 
Ef Ct sisasty) ER, 
and finally replacement for atoms 
RSA S=1 > Ri, 
E(fS) AF =f fe = ft. 
The only rules are — E,VI,,4E,. 


The corresponding version H,-IQCE of IQCE is obtained from 
H,-IQCE* by dropping Ex from EAX and replacing VI,, JI, by 
np ANEx>B 
>A > WyB[x/y]’ 
E BAEx->A 
> 3yB[x/y] > A’ 
where x € FV(A) and y = x or y € FV(B). 
N.B. For theories based on IQCE* plus axioms we may have to add SUB, 
the substitution rule. 


STR | 


REPL | 


VI 


4.6. THEOREM. Deducibility in H,-IQCE* and H,-IQCE is equivalent to 
deducibility in the natural deduction versions N-IQCE*, N-IQCE respec- 
tively. 
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Proor. Exercise. O 

4.7. H,-IQCE* and H,-IQCE. If we drop the axioms Ex in H,-IQCE* 
and replace 3-Ax5, V-Ax® by the special cases 

3-Ax,  A[x/y] > 4x4, 

V-Ax, Wxd > A[x/y], 


we obtain the system H,-IQCE*. 
If in H,-IQCE we replace the axioms 3-Ax, V-Ax5 by 


3-AxE A[x/y] A Ey > 3xA, 
W-AxE -Wxd A Ey > A[x/y], 
we obtain H,-IQCE. 


THEOREM. Deducibility in H,-IQCE*, H,-IQCE is equivalent to deducibility 
in H,-IQCE*, H,-IQCE respectively. 


PRooF. We give an outline and leave the details as an exercise. 

(i) Ex is derivable in H,-IQCE*. For x = x — x = x holds by proposi- 
tional logic, and thus with 4-Ax, x = x — dy(x = y). On the other hand 
Vx(x =x) > x =x from V-Ax,, and thus with the axiom Vx(x = x) and 
modus ponens 4y(x = y), and therefore by dy(x = y) also Ex. 

(ii) For any prime formula A and any term f occurring in A, one gets 
A — Et by induction over term complexity, repeatedly applying strictness. 

(iii) For any prime formula A by induction on the complexity of the 
terms occurring in A one shows with help of replacement and (ii) 


A[x/s] As=t7 Al[x/t]. (1) 


(iv) We establish (1) for arbitrary A by induction on the logical complex- 
ity of A. 

(v) We deduce 3-Ax5, V-Ax5 in H,-IQCE* as follows. By some logic in 
H,-IQCE* 


Et > 4y(t = y A WxA A Ey > A[x/y]), 
and therefore with (iv) 

Et > 3y(t=y A WxA A Et > A[x/t]), 
hence VxA A Et > A[x/t], etc. O 
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5. Kripke semantics 


5.1. In the first section of this chapter we have presented a set of rules for 
deriving intuitionistically correct logical laws; these rules were also valid for 
the classical truth-value interpretation of the logical operations. 

The notion “‘intuitionistically correct” or “intuitionistically valid” can be 
expressed more precisely as follows. Consider, for simplicity, a language # 
with relation symbols only, say R,,..., R,, where R; has n(i) arguments, 
and let A(R,...,R,) be a sentence in Y. An intuitionistic structure 
(model) for Y is a n+ 1-tuple #=(M,R,,...,R,,), where M is an 
intuitionistically meaningful domain, and R, is an n(i)-place relation over 
M, ie. R, CM", 

A(R,,..., R,,) iS (intuitionistically) valid in M (notation M* A) iff 
AM(R,,...,R,,) holds intuitionistically; here AM(R,,...,R,,) is obtained 
from A by replacing all occurrences of R; by R, and relativizing all 
quantifiers in A to M (i.e. Vx, 4x are replaced by Vx € M and 4x © M 
respectively). 

It is to be noted that formally 4“ A is defined in exactly the same way 
as for classical structures; the only difference is that we interpret 
AM(R,,...,R,,) intuitionistically. 

A is intuitionistically valid if “t A for all intuitionistic structures 
for #. We write — A for A is (intuitionistically) valid; if it is necessary to 
distinguish the intuitionistic reading of E from the classical one, we use a 
subscript: F;, F,. 

From the weak counterexamples it became clear that not all classical 
logical laws are correct in the sense of -,; , and thus we do not expect such 
laws to be provable in IQC. 

Showing directly, i.e. by means of combinatorial investigation of the 
possible deductions in IQC, that a certain schema is not derivable in IQC is 
usually far from easy. In classical logic underivability is mostly shown by 
giving a countermodel, but ; is technically not so easy to manage. 

Here we shall introduce a different semantical interpretation for showing 
underivability (which, moreover has many other applications to boot). For 
this semantics all rules of IQC are valid, and thus a schema invalid for this 
semantics cannot be derived in IQC. In 6.6 we will show that this semantics 
exactly fits IQC. 

This so-called Kripke semantics is not directly connected with the 
BHK- interpretation of the intuitionistic logical operators, but may be 
motivated via weak counterexamples. 

Recall the weak counterexample to PEM of 1.3.2, based on an as yet 
undecided proposition A. The argument depended on the following infor- 
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mation situation: 


1, A 


0 


the bottom node 0 represented our present knowledge concerning A: A has 
neither been established nor refuted; 1 represents a possible future informa- 
tion situation where A has been established. Thus in 0 we cannot assert A, 
nor can we assert —A, since a future situation 1 where A holds is not 
excluded. Hence at 0 we cannot assert A V —A. 

If the diagram above were a complete picture of the future information 
possibilities, seen from stage 0, we would be justified in asserting —— A at 0; 
for =A means that whenever we discover —A to be true, we must be able 
to obtain a proof of 1 (absurdity); but since in the diagram above we will 
never be forced to accept —A, we can for trivial reasons assert =A at 0. 

However, since A is an undecided proposition, the diagram is not a 
complete picture: there is also the possibility to discover that —A holds, ie. 
the situation is rather as in the picture below 


0 
Now we certainly cannot assert —— A at 0, since later we may arrive at node 
2 which together with _,A would force us to accept 1 ; but it is tacitly 


assumed that we shall never obtain inconsistent information. If this picture 
is regarded as exhaustive w.r.t. A, we may in fact simplify it to the 


following 
Ae 
0 


since at node 2 we will never be forced to accept A in a later stage. So, for 
trivial reasons, we may assert 4A at node 2 (“whenever in the future we 
discover A to be true, we can prove . ”). This picture also shows that we 
cannot assert ,A V —AA at 0. 

A more complex situation arises if we combine information regarding A 
and another undecided, independent proposition B; the diagram of possi- 
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bilities is 


A,B 


One may readily convince oneself that in this diagram we cannot accept 
(A > B) V (B — A) at node 0; for suppose we could prove A > B at 0. 
Now node 4 represents a possible extension of information where we have 
A and —B, contradicting the assumption that already at 0 we had A > B. 
Actually, (A — B) V (B > A) is unacceptable at 0 already in the following 
simpler information diagram: 


Aal %B 
Da 


We leave it to the reader to.convince himself of this fact. 
Now we are ready for a formal definition of Kripke semantics for IPC. 


§.2. DEFINITION. A (propositional) Kripke model is a triple #= 
(K, <, Ik), where (K, <) is an inhabited, partially ordered set (poset), and 
lk a binary relation on K X FP (F the set of proposition letters) such that 


kit Pandk’>k=>k't P. (1) 


We shall use the expressions “‘k forces P” or “P is true at k” for k iF P. 
IK is then extended to logically compound formulas by the following 
clauses 


Krl kKIKAAB=klI-A and ki B, 

Kr2 KILAVB:=ki-A or Ki B, 

Kr3 klLA>B:= forallk’>k, ifk’ Athen k’ tt B, 
Kr4 notk i 1 (i.e. k # 1 : no element of K forces 1). 
The elements of K are called nodes (of #). O 


REMARK. As a consequence of this definition 


kik 4A @ Whi > k(k’ KA). 
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Also 
k it} A @ Wk’ > kak" > k'(k” A), 
which is classically equivalent to 
Vk'> kak” >k(k” tt A). O 
5.3. Lemma. For all formulas of IPC we have monotonicity: 
Vk, k' CH(kI-FA and k’'>k=>k'tt A). 


Proor. By formula induction. Consider for example the case A = B> C, 
Assume klk BoC, k’ >k. kK” > k’, k’ I+ B, then also k” > k and 
k” IH B, hence by kik B> C also k” tL C; thus VA" S>k(k" I B= 
kK" IE C)ie kK’ I BOC. O 


5.4. DEFINITION. A formula A is valid at k in a Kripke model % iff 
kK lt A. A is valid in #= (K, <, \+) iff for all kK © K, k IF A; notation 
H+ A. If I is a set of formulas, we say that [i+ A (“A is a Kripke 
consequence of I"”) iff in each model % such that if for all BE I 4 B, 
then also #It A. A is Kripke valid (K-valid) iff 6 - A which will be. 
written as IF A. O 


REMARK. Suppose #= (K, <, IF) is a Kripke model, k € K. The srun- 
cated model %, is (K’, <’, IK’) where K’ = {k’:k’ > k}, <’is <[ K’ 
(the restriction of < to K’), and IF’ :=IFf (K’ xX #) (the restriction of 
IK to K’ XP), 

It is easy to see that k Ir A iff 4, IH’A iff k IF’A, since the definition of 
k i+ A depends on It for nodes k’ > k only. 

Note also that validity in % is equivalent to validity at the bottom node 
of %, if there is one. 


5.5. REMARKS. Observe that k IK A is not assumed to be decidable, not 
even for A prime. Even if k I+ P is decidable for P € # the implication 
clause may prevent k It A to be decidable for complex A. 

The definition of Kripke model may be read either constructively or 
classically. As a rule we shall treat Kripke models constructively, unless 
stated otherwise. In this connection we often use the terms “externally” and 
“internally”; external reasoning is reasoning on the meta-level about the 
models; something holds internally if it is (Kripke-) valid in the (or all) 
models under consideration; internal reasoning is “arguing by means of 
internally valid principles”. 
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If our external reasoning is intuitionistic, it might seem at first sight as if 
nothing is gained over the method of weak counterexamples: e.g. let # be 
a Kripke model with a single node, ky, where ky IH P iff A holds, and A is 
an as yet undecided proposition. Then we cannot assert ky + P V —P. 
However, a different Kripke model (see examples below) shows (classically 
and constructively) that * P V —P; by the soundness theorem proved 
below in 5.10 this implies IPC + P V 4P (and not just: we cannot show 


that IPC+ PV —P). 
If % consists of a single node and our external reasoning is classical, 
is equivalent to a classical valuation with truth values in {0,1}. 


5.6. EXAMPLES. ——P V —P is not K-valid; to see this, we specify a Kripke 
model by indicating the partially ordered structure as a diagram, writing 
next to each node the propositional variables forced at that node 


1 A 
0 
Clearly, 2 + —P; hence 0 # ——4P; and since 1 It P, also 0 K —P and 


therefore 0 « —,P V =P. 
Similarly, (P — Q) V (Q > P) is not K-valid, by the following Kripke 


model. 
1 \ 7 2 
0 


For a more complicated counterexample, consider 


0 


In this model 0 K (,P > P) V (AP V ——4P). For assume the contrary; 
then either 


QObH-—P or OH-AAP or Ol KAP-P. 
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0 I+ 4P is excluded since 3 Ik P; 0 lk —4P is excluded since 2 Ik —P; 
and 0 |-k —+P - P is excluded since 4 I+ —P, but 4 * P; hence 4 K 
——_P — P and thus also0 -K +P > P. 

As a result of the soundness theorem (5.10) this means that in IPC 
we cannot prove —_P V =P, (P > Q)V(Q->P) or (HP > P)vV 
(AP V =P). 


5.7. Kripke models for pure predicate logic. The notion of a Kripke model 
can be extended to IQC; we describe below the extension for a language 
with predicate symbols only. 


DEFINITION. A Kripke model for IQC is a quadruple #= (K, < ,D, IF), 

K inhabited, such that 

(i) (K, <) is a partially ordered set. 

(ii) D is a function (the domain function) assigning inhabiied sets to the 
elements of K such that all D(x) are inhabited and 


VkWk'(k < k' > D(k) ¢ D(k’)), 
ie. D is monotone. 
(iii) Let the language be extended with constant symbols for each element 


of D = U{D(k): k € K}; tt is a relation from K to the set of prime 
formulas in the extended language, such that 


kit R"(d,,...,d,) > d,;€ D(k) forl<i<n, 
kik R"(d,,...,4,) Ak sk’ = k' tH R"(d,,..., 4,). 

We now define k \r A for all sentences A in the extended language with 
constants in D(k). We will often loosely refer to the constants for elements 
of D(k) as parameters. For prime formulas, the definition is given by (iii) 
above; for the propositional operators we have the clauses Kr1—Kr4 as 
before; for the quantifiers we add, 


KrS k le Wx A(x) = Wk’ > kWd © D(k’)(k’ IF A(a)), 


Kr6 kt} dx A(x) = 34d Ee D(k)(kir A(d)). O 
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As to the intuitive motivation, we may think of D(x) as representing the 
elements of the domain which are known to exist at stage of information k; 
at later stages we may have discovered more elements belonging to the 
domain, hence D(k) C D(k’) for k’ > k. 


DEFINITION. For sentences in the language with constants for all d € D = 
Uf{D(k): k © K} we can define “A is valid at k” (=k forces A = A is true 
at k), A is valid in K, A is Kripke-valid, and T+ Aasin54,. O 


In dealing with Kripke models in the sequel, we shall tacitly assume that 
we are dealing with sentences (possibly containing parameters), unless 
explicitly stated otherwise. 


REMARK. We note also the fact that lemma 5.3 generalizes to IQC: 


kit A and k'’>k=>k'’ttA. O 


5.8. REMARK. There are other ways of presenting what is essentially the 
same notion of forcing in a Kripke model. 

(A) Instead of extending the language with constants for the elements of 
D = U{D(k): k © K} one assigns elements of D(k) to the variables of the 
language by a mapping 9, then defines “k | A under assignment ”; this 
method is well-known from classical model theory. 

(B) k lt R(x,,...,x,,) for a predicate letter R is given by a subset of 
D(k)", corresponding to 


{(d,,....d4,): 4d, € D(k) A... Ad, € D(k) Ak ib R(d,,..., d,)}. 


This variant suggests the definition of a Kripke model (K, < , IF) for 
pure predicate logic as a partially ordered collection of (classical) structures 
(K, < ,{.4,:k © K}), where ./, is a (classical) structure for pure predi- 
cate logic for each k € K. That is to say, if {R,;: 7 © I} is the collection of 
predicate symbols of the language, R, with n(i) arguments, then “4, = 
(D(k), {R,,: 7 € I), Ry, © D(A)", and k IF Ri(dy,..., d,) = 
R, ,(4,,...,4,,). The model must satisfy the monotonicity conditions: if 
k <k’, then 


D(k) Cc D(k’) and R,,C R; x: 


Warning: the monotonicity condition is strictly weaker than “.@, is a 
submodel of .4,.”. 
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5.9, EXAMPLES. Next to each node k we indicate D(x) and the atomic 
sentences true at the node. 


First example. 
1 DQ) = {a, b} R, Ra) 


0* DO) ={a} Ra) 
In this model 


0 He Vx(R V R{(x)) > RV VWXR (x). 


For we have 0 Ik Vx(R V R(x)), since 0 Ik R'(a), and hence 0 Ik R V 
R’(a); and alsol IF R, sollk RV Ra), 1! RV Rb). But we do not 
have 0 Ik R V Vx R(x), since 0 K R, and 0 KK Vx R(x), since 1 K R’(b). 


Second example. 
k, {0, 1,3, 3,4} RO, R1, R2, R3 
k, {0,1,2,3} RO, R1, R2 
k, {0, a RO, R1 
k, (6 1} RO 


| 
ky {0}. 


This model is a counterexample to —Vx(R(x) V +R(x)). Assume Kg IF 
~AVx( R(x) V AR(x)); then Vk(k K AVx( R(x) V AR(x))), so 


Vk a-ak! > k(k! Ik Vx(R(x) V aR(x))). (1) 


Assume k, Ik Vx(R(x) V AR(x)), then in particular k, IF R(p) Vv 
—R(p); R(p) does not hold at k,, but also k, i 4R(p), since k,, IF 
R(p). This shows “4k’ > k(k’ Ik Wx(R(x) V =R(x))), therefore (1) is 
false, hence ky HK —AVx(R(x) V =R(x)). Observe that even kg IF 
AVx(R(x) V AR(x)). 

Again by soundness (5.10), it follows that the schemas Vx(.A V B(x)) > 
A V Wx B(x) and —AVx(A(x) V —A(x)) are not derivable in IQC. 


5.10. THEOREM (soundness for pure —IQC). T/} ASIA. 


Proof. The proof is by induction on derivations in the system of natural 
deduction for 1QC. 
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Let a derivation 9 terminate with a rule application 
TI, FA, I, A,... DFA, 
Tra 


where A+ B expresses that B has been derived from assumptions A. Let % 
be any Kripke model, k € K. 

The induction hypothesis is as follows. For any k © K let I’, A’ be 
obtained from I, A; by some substitution of elements of D(k) for the free 
variables in I, A;. Assume that for all & € K and for all substitutions, if 
ki- Tj for 1 sis<n, then kt} Aj; k i+ TY abbreviates k I B’ for all 
Bel, 

We shall consider two cases. 


Case 1. Suppose the last rule applied is VE: 
I,+} AVB ,AtrC T,BEC 
MUrmUinrRC 
where A €I,, BEI. We have to show for any substitution, and any 
k © K thatifk li I] Ul; UT, then k  C’. Suppose kK IK TY UTS UT; 


by the induction hypothesis k Ik A’ V B’, kKIF A => KI-C’, KI Bo => 
k |b C’. Since k Ik A’ V B' > k IE A’ or k I+ B’, we have k Ik C’. 


Case 2. Suppose the last rule applied is VI: 


Tt A(y) 


eral SE t free in I). 
Teves) ten) 


For simplicity assume I and Vx A(x) to be closed. We have to show for 
each KEK: if kiKT then & + VxAx. Assume k IL IT; by induction 
hypothesis, for all k’ => k and all d © D(k’) we have k’ It A(d). Hence 
k lk VxA(x). 

The other cases we leave as an exercise to the reader. O 


5.11. Kripke models for the full language of 1QC. Let us now consider a 
language with equality, predicate symbols {R,;:i € 1}, function symbols 
and symbols for constants { f;: j © J}, where R; has n(i) arguments and 
f; has m(j) arguments. A classical structure @ for this language is 

(D, ~ ,{R,:i€7}, {f;: 7€J}), 


where D is an inhabited set, ~ an equivalence relation on D interpreting 
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equality, R, © D”” such that 

d~d’ and deER,>d’ ER, 
f,€ D” > D such that 

d~dotd~ td’. 


This slightly deviates from the usual definition of the classical model where 
~ is replaced by = ; for technical reasons we prefer retaining ~ . 


DEFINITION. A Kripke model for the above language is a partially ordered 
collection of such classical structures: 


#=(K, <,{(D(k), ~..(Ractt eT}, (fn: F EF}):k © K}) 


satisfying monotonicity conditions: if k < k’ then 
D(k) C D(k’); 
d~,d'>d~,d’; 
Rig C Rigs 
graph(f, ,) ¢ graph(f, ,). O 


We can also describe Kripke models for the full language as quadruples 
(K, < ,D, I+) where the forcing relation I+ for prime formulas has to 
satisfy a number of conditions at each node. First of all 


d~,@=kt-+-d=d' 


has to be an equivalence relation on D(k). Secondly, writing k Ir d=d' 
for Vi < n(k | d; = d;) where n = \th d = Ith d’, we must require 


kitd=d'=>kt fd=fd', 
kit-d=d' and kit Rd>kt- R,d’, 
plus the obvious monotonicity conditions. 

In the sequel, if we present a Kripke model of a language with equality 
and function symbols, we shall tacitly assume these conditions to be met. 
Note that the conditions just listed permit us to construct the partially 
ordered collection of classical structures of the definition above. The 


soundness theorem extends to the full language, as the reader can easily 
check: 


THEOREM (soundness for IQC). [/} ASTI A. O 
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5.12. Kripke models with restrictions. In dealing with languages with 
equality, and especially when function symbols are present, it is sometimes 
advantageous to use a different (but essentially equivalent) definition of 
Kripke model: a Kripke model is now defined as a quintuple (K, < ,D, 
l- ,~), where (K, <) is as before; D assigns inhabited sets to elements of 
K, and ¢ is a collection {9,,.: k < k’}, py, © D(k) > D(k’), of transi- 
tion functions (restrictions) such that 


P,, is the identity on D(k), 
Pr? Pew = Pray fork sk’ <k", 


and q,, preserves forcing, i.e. for prime formulas R(d,,...,d,) with 
d,,.-., 4, © D(k) , 


kit R(dy,...,d,) > KF R( @yy(d)),-- 5 Pane (dn)): 


An n-ary function f is interpreted in # by a family f = {f,: k € K}, the 
f, € D(k)” > Dck) commuting with the restrictions for k’ > k 
Peat (dis --+s An) = fel Peds ++ Predn)- 


Any Kripke model for a language with equality can be transformed into a 
Kripke model with transition functions where equality is interpreted at each 
node as equality: given (K, < ,D, I+), let 


d‘=d/~, ford€D(k), 
and put Dk) = {d*:d€ D(k)}, 9,,(d*) = d*. We leave it to the 
reader to show 


k it A(dy,...,d,)  k ih’A(dk,..., d*). 


5.13. Applications to the theory of equality. We give two examples of the use 
of Kripke models in the theory of equality. In the examples a, b, c and d 
are assumed to be distinct. The following nearly trivial model shows that 
HVxy(x=yVxF#y): 


1 a~,b 
0 ¢ D(O) = D(1) = {a, b} 
More interesting is the following proposition, showing that the axiom of 


choice (equivalently, the assumption of a Skolem function) is not conserva- 
tive in the presence of equality. 
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PROPOSITION. Let Y be the language of predicate logic with equality and a 
single unary function symbol f; let LY’ be LY without {. Let T be pure 
predicate logic with equality axioms Vx(x =x), Vxyz(x=yAx=z-> 
z=y). Then T, = T + {Wx(ax = fx), Vxy(x = y > fx = fy)} is not con- 
servative over T + {Vx4y(nx = y)} = T, relative to Z’. 


PROOF. We exhibit a Kripke model % for T, in which a theorem of T, is 
not valid: F == Wx,4yWxaAy,[x, # y, A x2 # yy A(X = X2 > VY, = Y2)] is 
not forced at node 0; % is given by the picture below. 


1 2 
a~,c¢ a~,b 
b~,d 
W) 
D(O) = DQ) = D2) = (a, b,c, d} 
~, is = on D(0) 


We leave the verification as an exercise. O 


5.14. Kripke models for E *-logic. The definition in 5.11 above can also be 
used for E*-logic. In the classical structures the f; may now be partial. The 
strictness axioms determine the interpretation of terms built from elements 
of D(k) with the help of the function symbols of the language via 
Et = 3x(t = x), (t=s5) = 4y(t=yAs=y) and (fi=x) = Iy(tT=y 
A f;¥ = x); the validity of basic relations applied to compound terms is 
also determined by strictness: R,f:= 3(t = ¥ A R,j). 

The soundness theorem extends to E*-logic without problems (exercise). 

For E-logic the formulation of an appropriate version of Kripke semantics 
is not quite so straightforward, because of the role of the free variables and 
the presence of the substitution rule. 

A sound interpretation is obtained by taking a Kripke model to be a 
partially ordered collection of classical structures with partial functions 


X= (K, <,{(D(k), ~, {Ricci el} (hc: FET}) kK), 


where now the D(k) may be possibly empty, but where D = U{D(k): 
k € K} is inhabited, and putting k Ik Ed iff d © D(k). Strictness then 
enforces that for atomic formulas P with FV(P) = {x,,...,x,}, KIF 
P[x,,...,X,/d),...,4,] implies that d,,...,d, © D(k). For formulas A 
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with FV(A) C {x,,..., x, } we require 
kit A(xy,...,%,) = Wd),...,d, € D(k F A(d;,...,d,)). 


As shown by Unterhalt (1986) E-logic is sound for this semantics, but 
strong completeness fails for open theories (such as the theory of global 
existence { Ex}), though strong completeness holds with respect to closed 
theories. 

Strong completeness is recovered if we let the interpretation E, of E at k 
be a possibly empty subset of D(x); strictness then must hold w.r.t. the 
interpretation of E only. But we shall not pursue the topic of Kripke 
models for E-logic here. 


6. Completeness for Kripke semantics 


In this section we present a classical completeness proof (of the 
“Henkin-type’”) for IQC and IQCE*. We shall appeal to completeness in 
the next section (7.5), but the result proved there can also be obtained by 
purely syntactical methods (7.13). Semantics for intuitionistic logic will be 
treated at length in chapter 13. 

N.B. The completeness proofs given rely on classical metamathematics. 


6.1. DEFINITION. Let I, A be sets of sentences in a given language #. The 
pair (I, 4) is consistent iff there are no finite I, C I, 4) C A such that 
+ AI, > VAg; here we take AM = T , VO = 1 . I is consistent iff (T,@) is 
consistent. O 


6.2. DEFINITION. Let C be a set of constants; a set of sentences I" in the 
language L.is C-saturated iff 

(i) T is consistent, 

Gi) FrRFASAET, 

dij) TRFAVB>TtrAorlees, 

(iv) Ft 4x A(x) = forsome cE C A(ce)ET. A 


REMARK. The appeal to classical logic on the metalevel occurs in the proof 
of the saturation lemma 6.3 below, as well as in the proof of the principal 
lemma 6.5 where we argue by contradiction. 

Lemma 6.3 below describes how to construct #(C)-saturated extensions 
for any consistent set of “sentences in I’, C a countably infinite set of new 
constants. A variant of the saturation construction is given in E2.6.3. 
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6.3. LEMMA (saturation lemma). Suppose T + A, I, A in a language L; let 
C = {€ 9, C1 €2,--- } be a countable set of constants not in £, and let £(C) 
be L extended with C. Then there is a C-saturated T*’ 2 T, such that 
T° vw A. 


Proor. We obtain I’ as U{I'*¥:k EN}; F°=T. I*\ T° is finite. We 
give an inductive definition of I. 

Let g(n) be the first i such that c; does not occur in I"\T, and let 
(B,, V B,); and (3xA;(x)), enumerate with infinite repetition all disjunc- 
tive and existential sentences of “(C). Suppose I* to have been defined. 


Case 1. k = 2n, [* + 3x.A,x. We put 
peti pry (Axes) } 3 
Case 2. k = 2n+1,I*+ B,, V B,,>. Put 
Pent? = Pmtl yy (BY, 
where i is the least of {1,2} such that [?"*1 U {B, ,} # A. 
Case 3. Cases 1 and 2 do not apply: put 
Dy41= Ty. 
We leave it to the reader to verify that I’ is saturated and that ® # A. O 


REMARK. Strictly speaking, the enumeration of the existential formulas has 
the form (4x,A;(x;)),, since the bound variable also depends on i; this 
dependence however is irrelevant since we agreed to regard formulas 
differing only in their bound variables as isomorphic. 

An alternative version of the construction uses enumerations of disjunc- 
tions and existential statements (without requiring infinite repetitions), and 
treats at stage n = 2k the first formula 1xA,,(x) not yet treated, and at 
stage n = 2k + 1 the first disjunction not yet treated. At n = 2k, when 
It 4xA,,(x), one adds A,,(c;) for the first c;<@C not occurring in 
I" U {4xA,(x)}. O 


6.4. DEFINITION (canonical model construction). Let Cg, C,, Cz, C3,... bea 
countable sequence of disjoint countable sets of constants not occurring in 
£; write C* for C, U C, U --- UC,,. Let I be any theory (set of sentences) 
in Y. Then we define 


X= (K, c, IF ,D) 
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such that 

(a) K consists of all I > Iq such that (1) = YU Cx, T saturated w.r.t. 
C* for some n; 

(b) if I is C¥-saturated and Y(I) = ¥ UC*, then D(T) = C?; 

(c) for prime P in P(D(T)): FH P=PeEDr. O 


6.5. LEMMA. For all IT © K and each sentence of £(D(I)) 
TRAeAET, 


Proor. By induction on the complexity of A. 
Case 1, For A prime the lemma holds by definition. 
Case 2. For A = B A C immediate. 


Case 3. Let A=BVC, thn THBVCe(rk B or Fi Ce 
(BeTorCEelr)eBVCET (by the saturation of I). 


Case 4. Let A = 4x B(x). If Ti+ dx B(x), then I i+ B(c) for some c € 
D(L); by the induction hypothesis B(c) € I for some c € D(I), and by 
the saturation properties of I’ this is equivalent to 4x B(x) € I. 


Case 5. Let A = B > C, and suppose I + B — C. Then for all saturated 
I’ > TI wehave I’ +t B= I” C. Assume D4 B> C, then IU {B} + 
C; let I’ be a saturated extension of I U { B} such that I’ # C (lemma 
6.3), then I” I+ B but not I’ it C (induction hypothesis); this contradicts 
T t+ B > C, hence [+ B > C. The converse is trivial. 


Case 6. Let A =VxB(x), and suppose [lr Vx B(x), TY Vx B(x), 
I C¥-saturated. Now, for any cE C,,,, [4 B(c); by lemma 6.3 we can 
find a C*,,-saturated I’ > I’ such that I’ # B(c), hence I’ i B(c) con- 
tradicting [+ Vx B(x). The converse is again trivial. O 


6.6. THEOREM (strong completeness for 1QC). 
THAS>TFA. 


PRooF. Suppose I # A, and let I be a saturated extension of I, A € Ih; 
construct a Henkin model as above, then I, K A. This yields completeness, 
also for languages with equality, if equality is interpreted at each node by 
an equivalence relation (cf. 5.11). For models with transition functions as in 
5.12 we have to take equivalence classes; the details are left to the reader. 

Oo 
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6.7. THEOREM. Strong completeness holds also with respect to 1QCE*. 
Proor. Exercise. O 


The remainder of this section is devoted to certain refinements of the 
completeness theorem and is not needed in the sequel. 


6.8. THEOREM (transformation of a rooted Kripke model into a tree model). 
Let # = (K, <, |) be any propositional Kripke model with root (origin) 
ko, ie. Wk © K(k) < k). Then there is mapping » from a Kripke model 
KH! = (K’, <’, IF’) to X, where (K’, <') a tree with root kj, such that 

(i) p(k) = kos 

(ii) for all A and k’ © K': p(k’) HA iff k' tA; 

(iti) ky <’ k5 > pki < pk} (@ is weakly order preserving). 


PRoorF. K’ consists of all finite sequences (ko,...,k,) with ky < k, < 
- <k,; 0 <' 7 iff o is an initial segment of 7. We define forcing for 
propositional variables by 


(kos +++a kp) H'P = k, Ie P. 


By formula induction one then shows 
{Ko,-.-» ky) I-’A @ kt A. 


We illustrate the construction by an example. Consider e.g. the partially 
ordered set (K, <), as on the left in the picture below; by taking finite, 
< -increasing sequences of nodes, ordered by the “initial-segment-of” 
relation, we obtain a tree (K’, <’) as shown to the right. 


3 0, 1, 2 (0, 2, 3) 


1 2 ¢0,1) (0,2) (0, 3) 
0) 


We leave the details of the proof as an exercise. O 
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6.9. COROLLARY. For any [finite] model #= (K, < , IK) we can construct 
some [finite] collection { 4, : i € I} of tree models such that for all sentences 
in the appropriate language X\- A iff for alli © 14, \+ A. 


ProoFr. Apply the preceding theorem to the truncations of #. O 


6.10. REMARK. (i) The construction permits some variations, for example 
(a) We can take finite sequences (ko,...,k,) such that ky <k, <--> 
<k,. 
aan 4 
(b) For finite (K, <) we can restrict attention to (kg,...,k,) such that 
each k;,, is an immediate successor of k,. 
(ii) The construction can also be adapted straightforwardly to predicate 
logic. 
We shail now show that for IPC Kripke models on finite trees suffice for 
completeness. 


6.11. THEOREM. Let #=(K, <, I+) be a Kripke model for IPC 
with root ky © K, and suppose ky A. Then there is a finite model #* = 
(K*, <*, Ir *) such that K* C K, <* is the restriction of < to K, and for 
all k © K* and all subformulas B of A 


ki-*B eo kit B. (1) 
Proor. The proof proceeds by the so-called “filtration method”. We 
successively choose nodes from K which are needed for making A invalid; 


the process terminates because A has only finitely many subformulas. Let 
S, be the collection of subformulas of A, and put 


S(k) = {BES,: kit B}. 
Initially we put 
Ky = {ko}, Kup; 


Suppose K, to have been defined, and let k @ K,\K,_,. Let 
-> ty, nce) be a maximal subset of K such that for i, j < n(k): 


ae 
ksth 5 
S(k) # S(t, ;); 
kk’ <t, > S(k) # S(k’) Vv S(k’) # S(t, ;); 


i#j > S(t.) # S(t,,;)- 
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We then put 
Ky41 = K, U foc (k € K,\K,-1) A i < n(k)}. 


Clearly, K,,,, = K,, for sufficiently large n, since S, is finite and has only 
finitely many subsets. Therefore we can take 


*:= U{K,:n EN}, 
kt+*P iffk\i-+ P for P atomic. 


We prove (1) for arbitrary subformulas B by induction on the complexity 
of B. Consider e.g. the case B = B, > By. k -* B, > B, iff Vk’ =>*k 
(k’ Ik* By => k’ I+ *B,) iff Wk’ 2*k(k’ it By = k’ i By), hence klk 
B, > B, implies k -* B, > B,. Conversely, assume k I+ * B, > B, but 
k l* B, > B,; then for some k’ > k k’ Ik B, and k’ i* B,. Suppose k € 
K,\ K,_. Then there must be a ¢,; © K;,,\K; (why?) with 7, ; #* B, 
— B, and so k i** B, > B,; contradiction. We leave the other cases to the 
reader. O 


We obtain now as an immediate consequence: 


6.12. THEOREM. 
(i) If IPC # A, then A is not valid on a Kripke model over some finite tree. 
(ii) IPC is decidable. 


PRroorF. (i) follows immediately from the preceding theorem together with 
6.6 and 6.8. 

(ii) By (i) we can enumerate all possible countermodels, that is to say we 
can in fact enumerate all Kripke-invalid formulas. We can also enumerate 
all valid (= derivable) formulas. With classical logic, every formula is either 
Kripke-valid or has a Kripke countermodel. Thus we can simultaneously 
enumerate valid and invalid formulas; each formula will occur as either 
valid or invalid. We conclude that IPC is decidable. O 


REMARK. It is possible to remove the classical steps in the argument, for 
example via a formalization of the completeness theorem (see Smorynski 
1982). For a direct syntactical proof of (11) see section 10.4. 
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7. Definitional extensions 


In this section we shall discuss several types of definitional extensions of 
first order theories: 
(a) the addition of definable predicates, 
(b) the addition of new variables for a definable domain, 
(c) the addition of definable partial or total functions and the addition of 
description operators. 


7.1. DEFINITION. Let T,T’ be two theories, T formulated in a language £, 
T’ ma language #’, 2C L’. T’ is a definitional extension of T, if there is a 
mapping @ from the formulas of #’ to the formulas of ¥ such that 

Gi) (A) =A for all formulas A of 2%; 

(ii) T’ + g(A) @ A for formulas A of Y’; 

(iii) T’+ A > TE (A); 

(iv) » commutes with all logical operations of Y. 

In particular, T’ is a conservative extension of T, i.e. 


T’ +}A=>T#A _ forall formulas A of Y. O 


7.2. THEOREM. Let T be a theory in a first-order language £, and let T’ be 
an extension of T to a language £’ =LU{M}, where M is an n-ary 
predicate constant, with an axiom 


Al Siastiyh) @ Mies t) 
for a formula A with FV(A) C {x),...,x,}, A a formula of #. Then T’ is a 
definitional extension of T. 


PRooF. Exercise. O 


7.3. THEOREM. Let T be as before, and let T’ have a new sort of variables 
(say a, b,c) with quantifiers Va,Aa; let A be a definable unary predicate T 
such that T + 4xA(x) and let T’ contain the additional axioms 


Vaax(a=x A A(x)), 
Vx(A(x) > Ja(x = a)). 


Then T’ is a definitional extension of T. 


Proor. Exercise. O 
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7.4. Addition of function symbols and description operator. This is more 
complicated. We consider theories based on E-logic or E*-logic with 
equality, and strictness for functions and predicates. We use f for function 
symbols and write 7 for a sequence of terms f,,...,,,, Pi? for P(t,....1,), 
¥=ffors,;=t,A--: As, =1t,, ft for f(t,,..., ¢,). Convenient axiomati- 
zations for our purpose are the systems H ,-IQCE* and H,-IQCE described 
in 4.7. In this section we shall use L* for IQCE*, where free variables are 
assumed to exist, and L for IQCE. 


7.5. THEOREM. The addition of a description operator to L or L* is a 
definitional extension. 


Proor. We give a semantical proof for the case of L*. Consider any Kripke 
model, say #= (K, <, I+ ,D). We can interpret the descriptor Ix. A(x, y) 
such that the axiom DESCR holds, as follows. 

A may itself contain descriptions Ix. B(x), Ix.C(x) etc. We assume these 
to have been interpreted already such that DESCR holds, (induction 
hypothesis). We simply put 


k i+ d= 1x.A(x, d,...,d,) 


ford,,...,d,,d © D(k), iff k Ik A(d, d,,...,d,) A Vx(A(x, dj,...,d,) & 
x =d), and the validity of DESCR is immediate. This shows that the 
addition of DESCR is conservative, as one can easily see. 

It is now not hard to construct a mapping p showing that the addition of 
DESCR is in fact a definitional extension. 

First we reduce arbitrary prime formulas f= s, P(t,,...,¢,) to prime 
formulas of the forms x = t, P(y,,..., y,) by repeated use of the equi- 
valences 


t=seody(y=taAye=s), 
Pi 50205 tj) PIs YI HHA 0 AHA P Vises Hs 


(y not free in ¢,5; y, not free in t,,...,1,). We now eliminate the 
descriptor from y = t by repeated use of 


y =1x.A(x) eo Vx[Axox=yl], 
y=f(7) @ aylz=f(W) A= 7). 


By strictness and DESCR, the result is a mapping y such that | y(A) @ A 
for all A. 
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For L a similar argument (Unterhalt 1986) is possible but we shall not 
carry this out. O 


7.6. COROLLARY. Let T be any closed theory based on L or L*, and suppose 
T to be axiomatized by closed axioms and axiom schemas (not necessarily 
closed ); if the axiom schemas translate under into instances or consequences 
of the axioms and axiom schemas, the addition of a descriptor to T is a 
definitional extension. O 


A typical example of such a theory T is HA. 

The preceding proofs of 7.5 and 7.6 are classical, inasmuch we relied on 
the classically established completeness proof for intuitionistic predicate 
logic. Below, we shall present a straightforward syntactical proof. 

7.7, NOTATION. We shall use the following abbreviation 
AIXB = 3xXW7Z(B[x/zZ] e7F=<x). O 

Warning. Do not confuse 4!x,x, and 4!x,4!x,! 

7.8. LEMMA. 

(i) IIXBoAXB A Vxz(B A B[xX/z] ~ 7= x), 


(ii) Let x’ be some permutation of x. Then I!x B <> Ax’ B. 
(iii) Leta FV(B) = 9, x 0 FV(C) =Q, then 


Alxu(B A C) @ XB A AWC. 
(iv) Let ¥ A (FV(C) U FV(D)) = Q; then 
WX B— (Ax(B A C) A 3x(B A D) @ 3x(BACAD)). 


ProoF. Exercise. O 


7.9, DEFINITION. Let A(x, y) be a formula of L (L*) with x, y € FV(A). 
Then L(¢~) (L*(q@)) is obtained from L (L*) by adding a function symbol » 
with axiom 

AX(A,g) VWxy(Wz(A(%,z) oy =z) o p¥=y) 
and extending the axioms and rules of L (L*) to the new language. O 


7.10. DEFINITION. We define a mapping * from L{q) into L, and from 
L*(q@) into L*, as follows. 
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For notational simplicity, let p be unary, i.e. X = x. All formulas of L(g) 
are supposed to be given with g-indexing, that is to say in any formula B 
we assume the occurrences of @ to be indexed in such a way that indices 
occurring in the same prime formula occurrence are all distinct. 

The definition of * is such that B* does not depend, except for the 
renaming of bound variables, on the choice of indexing. To indicate an 
occurrence of » with index i we write 9,. 

Below the variables yp, y,,... are always supposed to be fresh; y, y’,... 
are sequences of fresh variables. In writing 4yB, Jy’ B* etc. the y, py” are 
tacitly assumed to contain all y,’s free in B. 

We first define two auxiliary functions __,6 on terms, and terms plus 
formulas respectively. The underlining operator __ is specified by 


t= for g-free t; 


8 


if = Yes 
f(t... t,) = Tes es es | for function symbols f of L. 


The degree & is given by 
5(t) = 0 for -free r; 
8( pit) = 8(t) +1; 
5(f(t,,.--,t,)) = max(S(t,),...,8(4,)) (8(# ) for short); 
5(P(t,.--,t,)) = 8( ), 6(1) = 0; 
5( BoC) = max(6(B),8(C)) fore €{A,V, >}; 
6(VxB) = 5(4xB) := 6(B). 


In words: 5(B) is the maximum nesting depth for occurrences of in 
terms occurring in A. We now define simultaneously e and *: 


e(t):=T if ris p-free (e.g. T = Vx(x = x)); 

e(g,t) = A(t, y,)* A Et*; 

e(f(t,,---.%,)) = e(t,) A +++ Ae(t,) (e(7 ) for short); 
P(ty,...,0,)* = Alye(7) A ap(e(i ) A Pi), and 

* commutes with A, V, > ,V,3. 


By induction on 6(B) one readily shows that e and * are well-defined (note 
that A is -free, so 8( A(t, y,)) = 6(Et) = 6(9;t) — 1). O 
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7.11. LEMMA. For all B in the language of L: 

(i) B* is o-free; 

(ii) Lh B* © B for o-free B; 

(ili) L(p) F (Eg;t)* © Aly, A(t, y,)* A (Et)*s 

(iv) Lp) F (pit = x)" Vy (AG, y)* @ x =) A CED), 


and similarly with L* instead of L. 
Proof, Exercise. 0 


7.12. LEMMA. L*(@) and L(¢) are definitional extensions of L*,L respec- 
tively, via the mapping *. 


PROOF. We have to show 

(i) Lip) Bo B*, 

(ii) Lt(g)- B=> Ltt B*; 
(iii) L(g) + B= Lt B*. 


To keep the notation simple, we shall assume P, f below to be unary; y, y’ 
are the y-variables of ¢ and s respectively. We use = , = for derivable 
implications and equivalences. 

(i) can be proved by induction over 6(B). If 6(B) = 0, B is o-free, and 
we can use 7.11(ii). Now let 6(B) > 0; we first show for all ¢ with 
5(t) < 8(B) 


L(p) Fh t= xo (t= x)*. (1) 
Case (a). t a variable; trivial. 


Case (b). t = fs. Then e(t) = e(s), t= fs, and fs = x @ Az(s =z A fz = x) 
(STR, EAX) © 3z((s = z)* A fz = x) (application of (1) with ¢ = s, by the 
induction hypothesis) = 4z(3!ye(s) A Jy(e(s) As= 2) A fe =x) eo 
Alye(s) A Ay(e(s) A fs= x) = (fs = x)*. 
Case (c). t= 9,8. Then 8(s) = 8(Es) = 8(A(s, y,)), and 8(s) < 6(t) < 
8(B). Thus gjs = x © Wy,(A(s, y,) @ x =y,) A Es (by AX(4,9)) © 
Vy(A(s, y,)* @ x = y,) A Es* A Ex (induction hypothesis) = (@,s = x)* 
(by 7.11(iv)). 

Now we can prove B @ B*,. For B prime, assume B = Pt, 
x €FV(t). Then Pt I4x(Px A x =t) (by STR, EAX) @ 4x(Px A 
(x = t)*) (by (1)) = 3x(Px A Bye(t) A AV(e(t) Ax =D) ee 
Aly(e(t) A Aly(e(t) A Pt) = (Pt)*. For B not prime, we observe that * 
commutes with all logical operators. 
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(ii) Since * commutes with all logical operators, all axioms and rules of 
IQC are translated into instances of themselves; we only have to look at 
EAX, STR, REPL, AX(A, ¢). 


Case (d). EAX: (Et)* = (alye(t) A Ay(e(t) A Ed) © (Alye(t) A ay(e(t) 
AAx(x = 1))(EAX) © 3x(A!Fe(t) A ay(e(t) A x = 0) = (Ax(x = 1))*. 


Case (e). STR: P(t)* = (Alye(t) A Ay(e(t) A Pt) = Alye(t) A Ay(e(t) A 
Et) (by STR) = (£Et)*. One similarly proves (E( ft))* > (Et)*. (E(9;t))* 
— (Et)* follows from 7.11(iv). 


Case (f). REPL: (Ps)* A (s = 1)* = [B!¥’e(s) A A¥"(e(s) A Ps) A 
APY (e(s) Ate(t) A AVP (e(s\A s = D> [Aly’e(s)' A Aye(1)| A 
3P(e(s) A Ps A Ay(e(t) A s= 1) (7.8(ii), (iv) = (Pr)*. Quite similarly 
one proves 


(Efs)* a (s = 1)* > (fe = ft)* 
The remaining troublesome case is 
REPL(9,) = E(g;s) As =t > 9,5 = g;t. 


Observe that Es A s=t— Et and A(s,y,)As=t— A(t, y,) are in 
fact provable in L*(p) \ REPL(q;). Now [(E(9;,5)* A (s = t)*] © 
[4!y,(A(s, y)* A CEs)* A (s = 1)*)] (by 7.11(ii)) = [3!9,(A(s, )* A 
(Es)* A Aly, A(t, yi)" A (ED)* A yy (ACs, Yi) A ACL Y)* AY, = YO 
[Al yje(p,5) A Alyje(pjt) A Ayy(A(s, ¥)* A (Es)* A ACL, ¥)* A Er)* A 
y= a) = (95 = g;t)*. 

(iii) Completely similar. Use (Ex)* @— Ex to deal with the quantifier 
rules and axioms of L. O 


7.13. Syntactic proof of theorem 7.5. Any proof in T + DESCR is ac- 
tually carried out in a finite subsystem, where the descriptions 
Ix. A,(x), Ix.A,(x),..- used may be assumed to be such that 4,(x) 
contains only /x.A;(x) for i <j. Then, identifying each Ix. A,(y, x) with a 
function 9,(¥), we may think of the finite subsystem as being obtained by 
successively adding function symbols @,, 92, 93,... with axioms AX(A, q,); 
the AX(A, g;) coincide with DESCR for the corresponding description 
terms. O 
Observe that we now also have a syntactic proof of 7.6. 
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7.14. The descriptor in languages with function variables. We now consider 
languages which have function variables a, 8, y,... besides individual 
variables, with quantifier rules and axioms based on H,-IQCE*, i.e. 


Wad > A[a/B], Ala/B] > Jad; 


A> B=A > VBBla/B]; 


BA = 3BB[a/B] > A. Je CIN Ae ho Peeve), 


We consider logics LFT and LFP, based on L, with function variables for 
total and partial functions respectively. For LFT we add to two-sorted L 


FAXT = [WxE(t(x)) > daVx(ax = t(x))] A VaVxE(ax) 
and for LFP we add to L 
FAXP = aVxy(t(x) =yp @ ax=y). 


In the presence of a description operator we find, taking Iy. A(x, y) for 
t(x) in FAXT and FAXP 


AC!  Wx4lyA(x, y) > JaVxA(x, ax) 

in LFT, and 

APC! = FaVxy(Wz( A(x, 2) Oz =y) eax =y) 

in LFP. Instead of adding a descriptor we may equivalently consider 


theories with successive addition of function symbols. 


7.15. THEOREM. 
(i) LFT + DESCR + AC! is a definitional extension of LFT + AC! 
(ii) LFP + DESCR + APC! is a definitional extension of LFP + APC! 


PROOF. (i) We have to extend the definition of —, 6 and e by 
at = at, S(at) = (tr), 6(WaB) = 5(4aB) = 8(B), e(at) == e(t). 


We only have to check 
LFT + AC!  FAXT*. 
In LFT + DESCR — FAXT we can derive Et — d!z(z = t), and thus 


LFT + AC! — FAXT  (Er)* > (3!z(z = 1))*. 
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Now [(VxE(t))*] > [(Wx3!z(z = 1))*] = [Wxdlz(alye(t) A AV(e(t) A 
z =1))) > [BaVx(alye(t) A Ay(e(t) A ax = 1))] (by AC!) = [davx 
(ax = t)*], and hence FAXT* since (VaVxE(ax))* = VaVxE(ax). 

(ii) We have to check 


LFP + APC! + FAXP*. 
Observe LFP + DESCR — FAXP + t=u © Vz(t=z— z=), so 
LFP + APC! — FAXP# (t=u)* © (W2(t=zez=4))*. 


Now FAXP* = [AaVxu(ax =u @ t= u)}* © [JaVxu(ax = u © Vz(t =z 
oz = u)*)] = [FaVxu(ax = uo Wz(Allye(t) A Ay(e(t) A t = z) 
© z = u))] which is an instance of APC! O 


7.16. COROLLARY. Let T (T’) be a theory based on LFT (LFP), plus closed 
axioms and axiom schemas which translate under * into consequences of the 
axioms and axiom schemas. Then T + DESCR + AC! (T + DESCR + 
APC!) is a definitional extension of T(T’). O 


8. Notes 


8.1. There is extensive literature on intuitionistic propositional and predi- 
cate logic. Books in which a substantial part of the contents are devoted to 
IPC and IQC are a.o. Dummett (1977), Fitting (1969), Gabbay (1981), 
Prawitz (1965), Schiitte (1968); the paper van Dalen (1986) gives a survey. 
Further information on intuitionistic predicate and its semantics is to be 
found in chapters 10 and 13. 


8.2. The earliest published complete formalization of IPC and IQC is 
found in Heyting (1930, 1930A) (some historical comments on the signifi- 
cance of Heyting (1930, 1930A, 1930B) are in Troelstra (1978, 1981A). 
Kolmogorov (1925) and Glivenko (1928) contain partial axiomatizations of 
minimal logic. Johannson (1937) contains a formalism equivalent to MQC. 


8.3. Natural deduction. The natural deduction systems NJ and NK for 
intuitionistic and classical predicate logic were introduced by Gentzen 
(1935). Gentzen has both 1 and — as primitives. For 4 Gentzen adopts 
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the rules 
[A] () 
£77) 


an 
aA ale 


In other respects the system is identical with the one presented here. For 
the classical system NK Gentzen has in addition axioms A V 4A. The 
versions used in this book are identical with the systems as presented in 
Prawitz (1965). 


8.4. E-logic and E *-logic. Heyting (1930A) contains an early, only partially 
successful attempt to deal with partially defined terms. 

E-logic as presented here is equivalent to the system for first-order logic 
in Scott (1979); Scott was the first to stress the importance of a logic with 
partially defined terms and existence predicate, in formalizing mathematical 
theories based on constructive logic. 

E*-logic is used, for example, in Beeson (1985), and seems to be the more 
obvious choice of a logical basis if we have to deal with first-order 
axiomatizations of inhabited domains with possibly partial operations, such 
as the combinatory algebras studied in chapter 9. 


8.5. The “negative translation”. The translation * is found in Kolmogorov 
(1925) and is there applied to an incomplete formalization of intuitionistic 
predicate logic. The paper remained almost unobserved for a long time 
(presumably because it was in Russian); it is missing, for example, in the 
otherwise rather complete bibliography in Heyting (1934). The final section 
of Kolmogorov’s paper makes it clear that Kolmogorov realized the possi- 
bility of applying this translation to mathematical theories. For some 
comments on Kolmogorov’s paper, see H. Wang’s “Introductory Note” to 
the English translation of the paper in van Heijenoort (1967). 

Gédel (1933) and Gentzen (1933), independently, gave an embedding of 
classical arithmetic into intuitionistic arithmetic; apparently neither of them 
knew of Kolmogorov’s paper. Gentzen’s translation coincides with the 
translation ® (the choice of ® is to remind us of Gédel—Gentzen) as given 
here, except that the prime formulas are left unchanged. Gédel’s translation 
has for implication (A > By’ = -(A’ A —B’). Gentzen’s paper was writ- 
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ten in 1933, but withdrawn in the galley proof stage when he learnt of 
Gédel’s result. The formulation of the conservative extension result for 
arithmetic corresponding to 3.6 is due to Bernays (in Gentzen 1933). 

Kieene (1952, section 8.1) treats the Gédel and Gentzen versions of the 
translation at length, also for predicate logic, which requires the insertion of 
—— in front of prime formulas. 

For more historical comment see e.g. the introductory note to Gédel 
(1933) in his Collected Works (1986). 

A number of papers in the literature contain extensions of the conserva- 
tive extension result in 3.6, such as Mints and Orevkov (1967, our™3.26(ii)), 
Cellucci (1969, our 3.26(iii)) and Leivant (1971, our 3.20(iv)). Leivant (1985) 
contains another systematization of the conservative extension results, not 
only for predicate logic but also for mathematical theories. Our exposition 
in 3.11—26 is based on Leivant (1985) and (1971); 3.26(iv) is a combination 
of results from both papers. 


8.6. Hilbert-type systems. The term “Hilbert type system” is used for 
example by Kleene (1952), and presumably originates in the side-by-side 
comparison of the calculi NJ, NK, LJ, LK, and LHJ, LHK (“a logistic 
calculus according to Hilbert and Glivenko”) in Gentzen (1935). The 
distinctive feature of the Hilbert-type systems as understood here is that 
they characterize derivability without assumptions by means of axioms and 
rules, in contrast to sequent calculi such as LJ, LK. The name “Hilbert-type 
system” is historically not quite correct as a designation of this type of 
system, since many authors, following Frege, gave axiomatizations of this 
type. 

It is to be noted that the deduction theorem Tr, AF B=>ITtA-B 
can also be obtained under the following more liberal version of VI: 
[T, At C=, At WxC if x € FV(A) (Kleene 1952, section 22: deduc- 
tions with variables held constant w.r.t. the assumption A). 


8.7. Kripke semantics. This semantics (Kripke 1965) for IQC has its roots 
in Kripke’s earlier “possible world semantics” (1963) for modal logic, in 
combination with the well-known embedding of IQC in quantificational 
modal logic QS4 (cf. the introductory note to Gédel (1933A) in the 
Collected Works (1986)). The embedding has recently been extended to 
mathematical theories beyond logic; see e.g. the volume Shapiro (1985), and 
the papers by Flagg (1986), Flagg and Friedman (1986) and Scedrov (1986). 

The proof of the completeness theorem in 6.2—6 is of the Henkin type; 
several authors have given such a proof, e.g. Aczel (1968), Thomason 
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(1968), Fitting (1969), Luckhardt (1970). Luckhardt’s proof is related to the 
proof given by Schiitte (1968, section 15) for IPC; see also Troelstra 
(1978A) and E2.6.3. 

Unterhalt (1986) studied very thoroughly the completeness of E-logic and 
E*-logic relative to Kripke semantics. The interpretation indicated in 5.14 
for E-logic is not strongly complete as the following example of Unterhalt 
shows. Consider the open theory { Ex} (the theory of global existence). If % 
is a model of { Ex} then Vd € DVk(k |+ Ed), hence Vd € Dvk(d € D(k)), 
which makes D a constant function; but in Kripke models with constant 
domain (VxR(x) V P) @ VWx(R(x) V P) holds, (E2.5.6) while this schema 
is false for Kripke models with variable domains (5.9). 

E-logic is slightly more general in permitting empty domains, which may 
be an advantage if we are working in a many-sorted theory in which some 
of the domains are possibly empty (e.g. type theory in the version described 
in 3.9.5-8). 


8.8. Our sources for section 7 are Kleene (1952), section 74, and Renardel 
de Lavalette (1984) (also as chapter 1 of Renardel de Lavalette 1984A). The 
latter source also contains references to the literature. The syntactic method 
of proof (7.5~13) extends the syntactic treatment in Kleene (1952). 


Exercises 
2.1.1. Construct natural deduction trees (without short-cuts) showing the derivability of 
(A > B) > (4B > =A); 
3A(4 > B) - (A> 5B); 
an(A A B) @ 4 AB, (AA AaB) & (A V B); 
[4>((Bv (4>0))] > [44 BVO}; 
[A> (B>C)] 6 [(4> B) (4 >0)]; 
(AN B>C)e (A> (B-C)). 
2.1.2. Construct natural deduction trees (without short-cuts) showing the derivability of 
33xA(x) @ Wx3A(x); 


AVx-4A(x) 7 A49xA(x); 
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AV WxB(x) > Wx(A V B(x)) (x € FV(A)); 
3xA(x) V 3yB(y) @ Ax(A(x) V B(x)) (x € FV(B)): 
AxA(x) A 3yB(y) © axdp(A(x) AB(y)) (x € FV(B), y € FV(A)): 
(A > WxB(x)) 6 Wx(A > B(x)) (x € FV(A)): 


(AxA > B) @Vx(A > B) (x € FV(B)). 


2.1.3. Give a deduction tree for (A - —.B) - (-4A > —=-8B) without using derived rules 
such as R,R’ in 1.5. 


2.1.4. Prove, for A with x € FV(A): 


(A V AA) A (A V WXB(x)) @ Vx((A V AA) A (A V B(x))). 


2.1.5. A formula is said to be prenex if it consists of a quantifier-free formula preceded by a 
string of quantifiers. A formula A majorizes B iff 


COCK ASB, IQCHA OB. 


Prove the following theorem: for each A in the language of IQC we can find prenex A), A, 
majorizing A,—A respectively. Hint. First establish a lemma: let A, B be prenex or negated 
prenex; then there exists a prenex C majorizing A V B. This lemma can be established by 
induction on the number of quantifiers in A Vv B; at one step one uses the logical law of 
E2.1.4, Finally establish the theorem itself by induction on the complexity of A. 


2.1.6. Show that (cf. 1.10) 

(i) each formula of IPC constructed from a single proposition letter P is in fact logically 
equivalent to an A,(P) (n © N) or A,(P); 

(ii) between the A,, implications hold as indicated in the diagram of 1.10. 


2.1.7. Show that VE, VI, JE, 3I can be derived from the rules for L.A, — ,V in CQC if we 
define V,3 as in 1.6. 


2.1.8. Show that the system with axioms for equality is equivalent to the system with rules in 
subsection 1.8. Moreover, show that the replacement rule is derivable from its atomic cases 


Rus) cet bat t=" =e 
R(s.t',5’) (Ber =fGes’) t=" 


(R a predicate symbol, f a function symbol of the language). 
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2.1.9. Show that 1, is derivable from the instances of 1, with atomic conclusion. 


2.2.1. Show that for E-logic we have: if E* is another unary predicate obeying the same laws 
as E, then for all terms ¢ 


E*t© Et 

is derivable. Show also 
3ax[Ex A A(x)] @ 3xA(x), 
Vx [Ex > A(x)] @ VxA(x) 


(Scott 1979), 


2.2.2. Derive inIQCEr=asAt=sror=st=sos=tt=sAtsesrt=v; 
t3=s7s=t 

f is total iff Ex, A +++ AEx, > E({x). Show that f=5— ff = fS, as a schema, is 
equivalent to (2) of 2.6 + f is total. Prove (1) in 2.4 (Scott 1979). 


2.2.3. Give deduction trees for the derivations in the proof of 2.7. 


2.2.4. Let FV(A) € {x,,..., x, }. Show 
IQCE* A(x,,....%,) @ IQCE + 3xEx A Ex; A +++ AEX, 7 A(X. Xn)> 


(Renardel de Lavalette 1984, 1984A, 1.3.3(i)). 


2.2.5. Prove in IQCE the following properties of the descriptor: 
Ix. Ax = Ix. Bx @ 3y[Wx(Ax > x = y) AVx(Bx ox =y)]; 
Ix. Ax = Ix. Bx @ Vy[Wx(Ax @ x =y) AVx(Bx eo x =y)]; 
Wx(Ax @ Bx) > Ix. Ax = Ix. Bx 


(Scott 1979). 


2.3.1. Prove proposition 3.8. 
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2.3.2. Let B+ be obtained from B by replacing all prime formulas P in B, other than 1 , by 
PV 1. Then show IQC+ Be Bt and —IQCk B>MQCE B+. Hint. Show first by 
formula induction MQCF 1 At. 


2.3.3. Prove Glivenko’s theorem for propositional logic: 
CPCt A @ IPC —AA. 

Let DNS (“double negation shift”) be the schema Vx A(x) — 4WxA(x) and show 
CQCr B # IQC + DNS+ —B. 

(Glivenko 1929, Gédel 1933). 

2.3.4. Complete the proof of 3.12. 

2.3.5. Complete the proof of 3.18. 

2.3.6. Complete the proof of 3.20(i), (iii). 

2.4.1. Prove theorem 4.3. 

2.4.2. Prove theorem 4.4. 

2.4.3. Complete the proofs of 4.6 and the theorem in 4.7. 


2.4.4. Another Hilbert-type system is based on the following axiom schemas (Gédel 1958) 
AVA->A, A>PANA, APAVB, ANB->A, 
AVB>BVA, ANBOBAA, 1A, 
VxAx > At, At— IxAx 


and the rules VI,, JE, of 4.3, > E, and 


A>B A>B B>2C AAB>?C A>(B>9C) 
CVA>CVB A>C A>(B3C) AABSC 


Prove the equivalence with one of the other systems. (See e.g. Troelstra 1973, section 1.1.) 
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2.4.5. Show that the Hilbert-type system based on the following axioms and rules (Spector 
1962) is equivalent to one of the others. The axioms are instances of 


A>A, ANB>A, AAB>B, A>PAVB, BoAVB 
L—-A, WxAx > At, At—- AxAx 
and the rules are — E, VI, AI, (cf. 4.3) and 


A>B B-C A>~C Bw>C A>BA>C 
A>-C AVB>-C A>BAC 


ANB>2C A>(B>C) 
A>(B>C) AABOC 


(see e.g. Troelstra 1973, section 1.1). 


2.5.1. Construct countermodels to the following formulas 
aPv —4P; 
anP > (=P Vv P); 
(P> (QV R)) > ((P> QO) v (P= R)); 
[((P > @) >) > ((P’ > 8) > @)) > [((P > P') > @) > QI. 
It is to be noted that here as well as in the next exercise the same model may sometimes serve 


to refute several formulas. 


2.5.2. Construct countermodels to 
Wx Ax > aAVxXAX; 


an(VxAnAx > =AVxAx); 


a735xAx - 3x-=7Ax. 


2.5.3. Let Fo(Po, Pi) = ((Po > Pi) > Po) > Por Fua1(Por--++ Pra2) = 
Fy(Po, F,(Py.-+-> Py+2))- Construct Kripke models %;,, # such that 
(i) Forno n is F, valid in #, 

(ii) In 4% F, is refuted and F,,,,, holds for all k EN. 
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2.5.4. Let (K, <) be a finite partially ordered set such that the collection of upwards 
monotone sets (sets of nodes X such that Vk © XWk’> k(k’ € X)) contains at most n 
elements; then 


Em V (2°28) 


i<jsn 


holds in any Kripke model (K, < , IF). Prove this, and deduce that there is no finite Kripke 
model .¥ such that for all A IPC + A = #1 A. 


2.5.5. Let (K, <) be a fixed partial order such that for all Kripke models (K, <, IF) the 
formula (P ~ Q) V (Q > P) is valid. Prove: if K has a root, then (K, <) is linearly ordered; 
in general K satisfies Vxyz(x < yA xs z7>y<2Vz<y). 


2.5.6. A Kripke model with constant domain is a Kripke model = (K, < , I+ ,D) such that 
for all k © K D(k) = D. Show, reasoning classically, that in such models, if x € FV(A) 


Ik Vx(P V R(x)) > (PV VxR(x)). 


2.5.7. Complete the proofs of 5.3 and 5.10. 


2.5.8. Show that the conditions listed at the end of 5.11 suffice to recover a Kripke model as in 
the definition. 


2.5.9. Complete the proof of the proposition in 5.13. Reformulate the model with restrictions. 
2.5.10. Extend the proof of the soundness theorem to the full language and.to E*-logic, 


2.6.1. We call a pair of sets of sentences (I, A) complete, if for each sentence A, AE ITUA. 
If (I, A) is complete and consistent, I has the disjunction property. Show: Each consistent 
(I, 4) has a complete extension (I’, A} with Pc I’, A c A’, I’ C-saturated where C is a set 
of new constants. Formulate a corresponding generalization of the completeness theorem. 


2.6.2. Modify the construction of F° in the proof of 6.3 as indicated in the remark. 


2.6.3. Yet another modification of the construction of I in 6,3 is as follows. Let (A,,), be an 
enumeration of all sentences of £(C), with 4) = A. We define simultaneously I‘, F,, Gy by 
induction on k: put IF =I, Go = Ao, Fy any tautology. Assuming I‘, F,, G, to have been 
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constructed, we define **!, F, 4 5,Gy41! 
Case 1. FE GV Aga 


Fy. Fey TRY SK, Gy = GV Aga: 


Case 2. TF HG, Vv Ansys Aga, = IxB(x). Put 
ret} = Tk UY (AxB(x), B(ege))}, 


Fay = Fy A AXB(x) A BC ego). Gear = Gs 


Cece) first constant not in I* or G,. 
Case 3. TK UE GV Anais Aas = By V By. Put Pett = TAU (BV BB}, Fea = 
A B,, Guay = Gy, i the least of {1,2} such that Pr (FF, A B > G,). 


Case 4. If 1-3 do not apply, let P**! = F* U {Aya}, Gear = Ges Peay = Fy. 
Show that °° is C-saturated, and (for readers familiar with the basic notions of recursion 
theory) that °° is recursive in I. 


2.6.4. Show that for theories with decidable equality, (i.e. Vxy(x = y V x # y) holds) we have 
completeness w.r.t. normal Kripke models where = at each node k is interpreted by equality 
in D(k). Hint. Consider quotient structures obtained by taking equivalence classes 
modulo ~, . 


2.6.5. Carry out the completeness proof for IQCE* (6.7) in full. 


2.6.6. Let {.%, :i © I} be a collection of propositional Kripke models (K,, <, , (+; ) with 
disjoint K;. We put 


Ux=(UK.U <,.U m). 


iel ier iel req 


For any Kripke model (K, <, -) = let 4’ = (KU {kg}, <’, IF’) where ky € K, 
s'!K=<,k2kg forall k € K, kg %’P for P prime, Ir’ restricted to K coincides with 
Ir. 

Let C be such that no strictly positive subformula of C is a disjunction or an existential 
formula (cf. 3.23). Show that if for all i € J %; Ir C, then (U%j)’ IFC. 


2.6.7. Use the preceding exercise to prove (classically) the following strong version of the 
disjunction property for IPC: if C satisfies the conditions of the preceding exercise, and 
IPC+ CA v B, then IPC+ C—> A or IPC+- C > B. Hint. Argue by contraposition, use 
completeness. 
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2.6.8. Prove Glivenko’s theorem (E2.3.3) by means of Kripke models. Hint. Show that for a 
theorem A of CPC at each node of a finite Kripke model -—A must hold (use 6.11). 


2.6.9. Show that Kripke models over finite binary trees are sufficient for completeness for 
propositional formulas without V (Segerberg 1974). 

Hint. Let (K, < , IF) be any finite tree model, k © K, ky > k, k, > k immediate successors 
of k, Define (K’, <’, i’) such that K’ = KU {k’} (k'€K), <’t K=<, k’>’k, 
Wk" © K((ky < k” or ky < kh") @ k’ <’k") and for prime P 


VkKEK(ktH’Pektt P), 
(1) 
k’ IK’P & (ko Ik P and k, Ik P). 


Show that (1) extends to arbitrary formulas. 


2.6.10. A surjection from a poset (K, <) to a poset (K’, <’) is called a p-morphism (or 
strongly isotone mapping) if f satisfies (i) x < y > fx <’fy and (ii) Vx © K’Vy € K(fy <’x 
> 32 €K(y <2 A fz = x)). Let #= (K, <, IF), #’ = (K’, <’, Ik’) be two Kripke mod- 
els, and f a p-morphism from (K, <) to (K’, <’) such that k Ik A iff fk IF’A for atomic A; 
prove that this holds for all A. 


2.6.11. If is a collection of posets such that each finite tree can be obtained as the image of 
an element of under a p-morphism (see the preceding exercise), then IPC is complete for 
Kripke models over posets from .. In particular, show that IPC is complete with respect to 
Kripke models over the infinite binary tree. 


2.6.12. Let ., = (K,;, <;) be a poset for i<n, and define m% X --+ X#%_) 
as (Ko X°--: XK,_1, <’) where (ko,....k,-1) S$‘(ko.-.+. khu1) @ Vi<n 
(k; <; kj). If x= (K, <) is a poset, we let 2/7’ = (K U {k*}, <’) be the poset (modulo 
isomorphism) such that k* € K, k <’k’ iff k < k’ for all k,k’ © K, k* <k for all k EK. 
Now define a sequence of posets as follows. 2% := the singleton poset with trivial ordering; 
A, = (Hy) X Dy); DH.) = (HM, X +++ XH,)'(n + 2 factors). Show that IPC is complete for 
Kripke models with trees from the sequence (.%,),,. (This is virtually Jaskowski’s result (1936); 
cf. also Smorynski 1973A, or Smorynski in Troelstra (1973, 5.3.8).) 


2.6.13. Give a completeness proof for IPC + LO with respect to linearly ordered Kripke 
models, where LO is the axiom schema (A > B) V (B = A). Hint. Suppose IPC + LOK F, 
then a fortiori IPC + Y F, where Y= {(A > BV (B— A): A and B subformuas of F}. 
Extract from a tree-countermodel for IPC + Y F a linear countermodel. 


2.6.14. Let us call a formula positive if it is constructed from the function symbols and 
relations of the language using 1 ,A,V,3, =. A formula is geometric if it is of the form 
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Yx(A — B), where A and B are positive. Let A, (k <n) and B be geometric; show that if 
Fk. A{A, 2k <n} > B, then +, A{A, 2k <n} > B. Hint. Observe that positive formulas 
are forced at a node of a Kripke model iff they are classically valid at that node. 

2.7.1. Prove 7.2. 

2.7.2. Prove 7.3. 


2.7.3. Prove 7.8, 7.11. 


CHAPTER 3 


ARITHMETIC 


In the present chapter we treat intuitionistic /constructive arithmetic at 
some length. The first five sections are elementary in character, the rest 
mainly contains material needed in chapters 9-15. 

For the reader who primarily wants an introduction to constructive and 
intuitionistic mathematics, a brief perusal of the sections 1, 3, and 4 suffices. 
The result in 3.4 explains why intuitionistic elementary arithmetic is little 
different in character from classical elementary arithmetic: the vast majority 
of well-known theorems of classical elementary number theory can be 
stated in IIé-form and are therefore also intuitionistically provable. 


Summary of the contents. Section 1 introduces the class of primitive recur- 
sive functions PRIM and lists certain of its closure properties. 

Section 2 describes primitive recursive arithmetic PRA, a system without 
quantifiers, and illustrates how some basic theorems of arithmetic can be 
established in this system. This section is not needed for the next section on 
intuitionistic first-order arithmetic HA. 

Section 4 presents a brief analysis of the informal notion of algorithm 
and deduces from this analysis some basic properties such as the smn-theo- 
rem and the recursion theorem. The contents of this section are needed in 
sections 4.3 and 4.4, and in later chapters where examples of constructive 
recursive mathematics are given. 

Section 5 gives some examples of metamathematical properties of HA 
such as the disjunction and existence. property, closure under the “indepen- 
dence of premiss”-rule, and closure under Markov’s rule. 

Sections 6 and 7 contain an introduction to elementary inductive defini- 
tions and give an outline of the formalization of elementary recursion 
theory in HA and elementary analysis EL. This material is needed in 
Chapter 9, and also to provide a rigorous backing to the proof sketch of the 
characterization theorem for realizability in section 4.4. 
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Section 8 is devoted to intuitionistic second-order logic and arithmetic 
with set variables, and section 9 discusses the corresponding generalization 
to all finite types. 

Though the sections dealing with elementary recursion theory, namely 1, 
4, 6 and 7 are in principle self-contained, the treatment is condensed and 
some preliminary acquaintance with the basics of recursion theory will be 
helpful. 


1. Informal arithmetic and primitive recursive functions 


1.1. As already noted in 1.2.1, natural numbers are generally regarded as 
unproblematic, and are treated as basic constructive objects all of the same 
kind (except from the ultra-finitist point of view mentioned in 1.2.1). Thus 
the “Peano-axioms” are regarded as obviously true, i.e. if S is the successor 
function then 


x=y @ Sx = Sy, 
0 # Sx, 


and induction is accepted for all constructively meaningful properties of 
natural numbers A(x): 


A(0) A Wx( A(x) > A(Sx)) ~ Wx A(x). 


The justification of induction is based on the mental picture we have of the 
natural numbers as obtained by successively adding “abstract units”; given 
A(0), Wx( A(x) — A(Sx)) we build parallel to the construction of n E Na 
proof of A(n): 


A(0)_ A(0) > A(Q1) 
A(1) A(1) > A(2) 


A(2) ... 
etc. 


Number theory uses many operations besides successor, but also for 
metamathematical purposes one needs at least addition and multiplication. 
However, it is extremely convenient to have a more ample supply; for this 
we shall take the class of primitive recursive functions PRIM, each primi- 
tive recursive function will be quite obviously constructively well-defined. 

The remainder of this section will be devoted to the definition and 
properties of PRIM. 
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1.2. DEFINITION. The class of primitive recursive functions, PRIM, is gener- 

ated by the following clauses 

(i) Z,S, p? for i <n, n > 1 belong to PRIM. Here Z is the zero-function 
satisfying Z(x) = 0, S is the successor function, and p? is a projection 
(-function) determined by p?(Xo,.--, X,—-1) = X;- 

(ii) PRIM is closed under composition: if f, g, € PRIM, with fe N" > N, 
g, © N* >N(1 <i< hk), then there is an h © PRIM satisfying 


h(x) = f(g,(X),..., 8,(%)). 


(iii) PRIM is closed under definition by recursion, ie. if fEN"-N, 
g © N"*? SN, then there is an 4 € PRIM such that 


ee X) = f(x) 
h(Sy, X) = g(h(y, ¥), y, X). O 


By “PRIM is generated by (i), (ii) and (iii)” we mean that PRIM is the least 
class satisfying (i), (ii), (iii). We may think of all primitive recursive 
functions being numbered, as @o, 9), P2,... Such that each ¢, is either one 
of Z, S, pi, or defined from certain g,’s with j < k; properties of PRIM 
may now be established by induction on k (N.B. repetitions may occur). 

The reader who is familiar with elementary recursion theory may skip the 
rest of this section. We shall now introduce some familiar arithmetical 
functions as elements of PRIM. 


1.3. Some primitive recursive functions. 
Constant functions are primitive recursive. Thus for fixed m the fe N” > N 
satisfying 


f(x)=m 


is in PRIM: if m = 0, take f(x) = Zpg(x), and for m = Sm’, given g with 
g(X) = m’ for-all x, let f(x) = S(g(x)). 

Addition is usually given by x +O=x, x + Sy = S(x + y). Writing 
+(y, x) instead of x + y we can bring this in the standard form prescribed 
by definition 1.2: 


+(0,x) =ph(x); +(Sy, x) = S(p3(+(y, x), y. x)). 


After this example of a definition in standard form we assume that the 
reader can provide detailed formulations, if desired. 
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Multiplication 
x-0=0, x-Sy=x-ytx. 


Exponentiation 


Predecessor 
prd(0) =0, prd(Sx) =x. 
Cut-off subtraction 
x+O=x, x + Sy =prd(x + y). 
Note that x — y = Oiff x < y, x +> y = x — y otherwise, ie. if x > y then 
yt(x-y)=x. 
Absolute difference 
Pome (ey) lye): 
Signum function sg and inverse signum function sg’ 
se’(x) =1—+x, sg(x)=1+(1 +x). 
So sg(x) = 0 iff x = 0, and sg(x) < 1 for all x. 
Maximum and minimum 
max(x, y)=x+(y= x), min(x, y)=x=(x = y). 


The properties listed below in 1.4—1.14 are presented without proof; these 
may be regarded as exercises, or looked up in a standard text on recursion 
theory. 


1.4. PROPOSITION (closure properties of PRIM). Let o be a permutation of 
{1,...,} and let g © PRIM. Then also f © PRIM in the following cases 
(i) f(X) = 8(Xeays-+ +s Xony) (permutation of arguments), 

(ii) f(X) = 8(%4, x) (identification of arguments); 

(iii) f(y, = g(X) (dummy arguments); 

(iv) f(z, x) =H, <.8(), x) (finite product) with 


Te.9=1 80.2) = 82 2)-(T0.9)}; 


y<z 
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WY): SGX) = Eye 205 x) (finite sum) with 
Lalve=0 Y s(vX)=8(e.3) +( Da.¥)}; 
y<O0 y<Sz y<z 

(vi) f(z, ¥, ¥) = 27(y, X) (iteration) where 


8°(y, X) = poy, ¥), 8(y, ¥) = 9(87(y, X), ¥). 


(vii) f(z, x) = min, <,(g(y, X) = 0) (bounded minimum operator) where 


min, <.(g(y,%)=0) = ¥ se I] s(y¥)). 


u< Sz y<Su 


Thus min, <,(g(y, x) = 0) is the least y <z such that g(y,xX) =0, if 
existing, z + 1 otherwise. O 


1.5. DEFINITION. A relation R is said to be primitive recursive, iff its 
characteristic function x p, satisfying x p(X) < 1, xp(X) = 0 © R(X), be- 
longs to PRIM. O 

EXAMPLES. = , < are primitive recursive since 


Xu (x, ¥) = se(lx — yl), x2 (x, y) = se(x-y). O 


1.6. PROPOSITION. The primitive recursive relations are closed under intersec- 
tion, union, complementation and bounded quantification, and under substitu- 
tion: if R(x, y) is primitive recursive, then so is R( f(x, ¥,Z ), Y) for any 
f¢PRIM. oO 
1.7. DEFINITION. If R(y, X) is primitive recursive, we define 

min, <,R(y, ¥) = min, <.(xXe(y,¥) = 0). O 


EXAMPLES. The following predicates and functions are primitive recursive 
x|y = 32 <y(x-z=y) (divisibility); 
Prime(x) = Vy <x(aylx Vy =1)Ax#0AxX#1 (x isa prime); 
0! = 1, (Sx)! = (x!) + Sx (factorial function); 
p(Sx) = min, < y¢xy41[Prime(y) A y > p(x)], p(0) = 2 
( p(n) is the nth prime). 
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1.8. Coding of p-tuples. In arithmetic we can talk about p-tuples of natural 
numbers and finite sequences of natural numbers via suitable codings. For 
the coding of pairs we need a mapping j € N? > N with inverses j,, j, 
satisfying 


Ada yl =x, fyi(x, y) =y. 


It is often convenient, though not strictly necessary, that the pairing is 
surjective, i.e. 


HAZ. oz) = 2. 
From j, j;, /. we can now construct codings v? for p-tuples with inverses 
jf such that 


PONG cre hy ma; USi <p): 
To achieve this, we may put for example 

v'(x9) =X, 

BP Ses Kv anspne) = Ld 6 eee Fae ilxpais* )). 
It is obvious how the j? are to be defined from j, and j,. O 

It is easy to see that if j is surjective, so are the v” for all p. There are 

many possible choices for j, a convenient one is 
1.9. DEFINITION. 

I(x, y) = 2*- Qy +1) = 1, 

Az= min, .,[3y < 2(2*+ (2y + 1) = Sz)], 

jyz = min, .,[4x < z(2*-(2y +1) = Sz)]. O 


Another well-known pairing function is j’(x, y):= 3(x + yx +y +1) 
+ y. 


1.10. PROPOSITION. j as defined above is a surjective pairing with inverses 
Ji» Jz which satisfies in addition monotonicity conditions 

@) x <j(x, y), ¥ Si(x, Y), 

(ii) x <x’ > (x, y) <j(x', vy) ALY, x) <I, ¥); 

Gii) 70,0) = 90. O 


1.11. DEFINITION (coding of finite sequences). If j is a surjective pairing 
with inverses j,, /,, such that (0,0) = 0, and the v?’s are the p-tuple 
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codings defined from j, we can code arbitrary finite sequences xy,..., x 
by a number (xo,..., x,,) such that 


u 


{) = 0 (zero codes the empty sequence), 


(Xoyeees Xu) = j(u, PEN xh) +1. 


It does not make sense to describe this coding as primitive recursive, since it 
does not have a fixed number of arguments; however, it can be completely 
described from the following operations: the concatenation function * such 
that 


(Xoye +s Xu) * (Kus ayes Kato) = (Kove ss Kuro)» 
the /ength function Ith such that 
Ith) = 0 
Ith(x,...,x,) =ut], 
and the decoding function * € N? > N such that 
m(n,y) =x, ifn = (xo,...,x,) and y < Ith(n) 
a(n, y) = 0 in all other cases. 
We usually write 
(n), fora(n,y). O 
*, lth and 7 are primitive recursive in j (exercise). We put 
n<m:=4n'(n*n' =m), n&m'=msn, 
n<m:=nxmAn#m, n>~mi=m~<n., 


1.12, PROPOSITION. 
(i) Ith(n * m) = Ith(n) + Ith(m), 
(ii) (n*m), = (n), for y < Ith(n), 
(n*m), = (M),-1ncn) for y 2 Ith(n). O 


1.13. DEFINITION (course-of-values function). For any function » we define 
@ as 
p(0,m)={), G(Sn,m) = 9(n,m)*(p(n, m)). O 


1.14. PROPOSITION (course-of-values recursion). If x and are primitive 
recursive, so is p given by 


p(0,x)=x(X), p(Sz, ¥) = ¥((Sz, x), 2z,¥). O 
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2. Primitive recursive arithmetic PRA 


2.1. The usual presentation of number-theoretic arguments may create the 
impression that the full force of first-order logic, and in particular the use of 
quantifiers, is necessary. Therefore it comes more or less as a surprise that a 
considerable fragment of arithmetic can be handled in a quantifier-free 
fragment. In this section we illustrate this by developing some arithmetic in 
the quantifier-free part, PRA, of intuitionistic arithmetic HA. The contents 
of this section are not needed later on. 

The language of PRA contains free variables for natural numbers, 
constants 0 (zero), S (successor), a function symbol for each primitive 
recursive function, and the binary predicate = (equality). The axioms and 
rules are the axioms and rules of IPC with equality (with respect to the 
language of PRA), the axiom 


+S0 = 0, 
the induction rule 
A(0), A(x) > A(Sx) = A(t) 
for quantifier-free A, and the defining equations for all primitive recursive 


functions. 
In particular, we have constants prd, ~, +, - satisfying 


prd(0) = 0, prd(Sx) = x; 
x+O=x, x+Sy=prd(x-y); 
x+0=x, x+Sy=S(x+y); 
x'O0=0, x: -Sy=x-ytx. 

The equality axioms yield 
Sx = Sy > prd(Sx) = prd(Sy) > x =y, 


so we do not need Sx = Sy > x = y as an axiom. 
In the absence of quantifiers, we need a substitution rule 


A(x) 
A(t)’ 


where A(x) has been derived from hypotheses not containing x free. We 
can avoid the introduction of a substitution rule by stating all axioms as 
schemata, arbitrary terms replacing the variables (e.g. + 0 = 14, 1+ St’ = 
S(t + t’) for addition). 

In a natural deduction formulation we can take besides axioms tf = f, and 
defining equations for primitive recursive functions the following rules 


SUB 
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[4(x)] 
0=S0 th=t, A(t) A(0) A(Sx) 
L A(t>) A(t) 


In the third (induction-) rule the assumptions indicated by [A(x)] are 
discharged. Actually, ,SO = 0, or the corresponding natural deduction 
rule, is redundant since we have 


2.2. PROPOSITION. is definable in PRA as 0 = SO. 


Proor. Let @ be the primitive recursive function such that 


(ty. t2,0)=t,, p(t. t), Sx) =h. 

Then 0 = SO > ¢, = 9(f,, f,,0) = p(t), tp, SO) = ¢, and thus 
0=S0->-P 

for all prime formulas P; by an easy formula induction we then establish 
0=S0-A 

for all quantifier-free A. In the induction —B is to be interpreted as 


B>0=S0. O 


REMARKS. This proposition shows that in the absence of 1 as a primitive, 
0 = SO can play the role of 1 . However, if 1 is present as a primitive, 
0 = SO > 1 cannot be proved unless we postulate it. 

Observe also, that for quite general logical reasons, the equality axioms 
all follow from reflexivity, symmetry, transitivity and the following special 
cases of substitution. 


Xp = XEN XQ = XANAX, =H XL 
P(X yp <525- Xe) SMe KD 


for all function constants 9. 

Finally note that, for any term f(x) in the language of PRA, there is a 
primitive recursive function g such that ¢(x) = p(x) holds in PRA. 

Our next aim is to generalize the induction schema. 


2.3. PROPOSITION (special induction). The following schema holds in PRA 
A(0), A(Sx) = A(t) 
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Proor. Immediate from the induction schema. O 


2.4, PROPOSITION. 

Gi) O-x=0; 

(ii) Sx + Sy=x-y; 

(iii) Sx — x = SO; 

(iv) x —~ x =0; 

(v) x =0V x = S(prd(x)); 

(vi) x #0 © x = S(prd(x)), x = Sz > x = S(prd(x)). 


PROOF. (i) 0 + 0 = 0 (defining axioms for ~ ), and 0 + Sx = prd(O ~ x) 
= prd(0) = 0 (under the induction hypothesis 0 ~ x = 0); hence 0 + x = 0 
by induction. 

(ii) Sx — SO = prd(Sx = 0) = prd(Sx) = x = x + 0; assume Sx — Sy 
=x -— y, then Sx + SSy = prd(Sx + Sy) = prd(x + y) = x ~ Sy; induc- 
tion yields (ii). 

(ili) SO~ 0 = SO; SSx + Sx = Sx + x = SO (with (ii) and induction 
hypothesis Sx + x = $0). Induction gives (iii). 

(iv) x — x = Sx + Sx = prd(Sx + x) = prd(S0) = 0 (by (ii), (iii). 

(v) and (vi) are left to the reader; for (v) one uses induction. O 


As an illustration of the process of complete formalization we exhibit a 
completely formalized proof of (i), based on the logical rules in section 2.1 
and the axioms and rules of subsection 2.1. 


dq) y+0=0 (defining axiom for ~ ) 
(2) 0-0=0 (SUB, (1)) 
(3) y- Sz =prd(y-z) (defining axiom for — ) 


(4) 0- Sz = prd(0- z) (SUB, (3)) 
(5) 0- Sx = prd(O — x) (SUB, (4)) 


(6) 0-x=0 (hypothesis) 
(7) prd(0) = prd(0) (REFL) 
(8) prd(O ~— x) = prd(O) (REPL, (6), (7)) 
(9) prd(0) = 0 (defining axiom for prd) 
(10) prd(O + x) =0 (TRANS, (8), (9)) 
(11) 0-Sx=0 (TRANS, (5), (10)) 
(12) 0-x=0 (ind. rule, (2), (11), elim. of (6)). 


It is to be noted that the proof of 0 = SO > A in 2.2 above is not an 
(informal) argument in PRA; rather, it is a metamathematical argument 
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showing how to generate proofs of 0 = SO — A in PRA for all formulas in 
the language of PRA. 


2.5. PROPOSITION. Let t,t), t, be terms; then the following rule holds in 
PRA 


A(0, y), A(x, t(x, y)) > A(Sx, y) > A(t, 4). 


PROOF. It is enough to establish the rule in the form 


A(0, vy): A(x, p(x, y)) > A(Sx, y) =. A(t, th), 


for p a primitive recursive function symbol (cf. remarks in 2.2). Assume 
A(O, y) and A(x, p(x, »)) > ACSx, y) and define by recursion f such that 


¥(0) = ty), H(Sz) = pty + Sz, o(z)). 


If Sx =f) — z, then ty + Sz = prd(tp ~ z) = prd(Sx) = x, hence, since 
A(ty — Sz, P(ty + $2, ¥z) @ A(ty — Sz, ¥(Sz)) and A(x, p(x, y)) > 
A(Sx, y), 


Sx = ty +z (A(t) + Sz, ¥(Sz)) > A(to +z, ¥z)]. 
Since A(0, (z)), also 

0 = tg 2 [A(ty + Sz, ¥(Sz)) > A(to + z, ¥2)], 
and therefore by special induction (2.3) 

to — 2 = ty =z [A(ty = Sz, ¥(Sz)) > A(ty + z, ¥z)], 
hence 

A(to + Sz, W(Sz)) > A(ty — z, Wz) 
and therefore 

[A(t + 2,42) > A(t, 4)] > [A(to = Sz, ¥(Sz)) > A(to, 4)I- 
Also 

A(ty ~ 0, YO) > A(to, t), 
and so by induction 

A(ty = to, ¥(to)) > A(to, 41); 
and thus A(t, 4). O 
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2.6. PROPOSITION. The following form of double induction holds in PRA: 
A(x,0), A(O, vy), A(x, y) > A(Sx, Sy) = A(to, th). 


Proor. Assume A(x,0), A(0, y), A(x, y) ~ A(Sx, Sy) to be derivable; 
y = Sz > S(prd(y)) = y (2.4(vi)), and so 


y = Sz > (A(x, prd(y)) > A(Sx, y)); also 
y =0 > (A(x, prd(y)) > A(Sx, y)) 

(since A(Sx,0) by hypothesis). Therefore by 2.3 
y=y > (A(x, prd(y)) > A(Sx, y)): 


now detach y = y and apply the preceding proposition. O 


2.7, DEFINITION. 
x< y= (x+y =0), 
x<y=(Sx+y=0). O 


2.8. PROPOSITION. 

(i) x=yVx#¥#y; 

Gi) xs yAysx>-x=y; 
(ili) x Sy Vy <x; 

(iv) x<yox=ypVx<y; 
(Vv) x=yVx<yVy<x; 
(vi) =axSyoy<x. 


ProoF. (i) Apply double induction. x = 0 V x #0 follows by special 
induction 2.3 on x, similarly 0 = y V0 + y; and if x =y V x # y, then 
Sx = Sy V Sx # Sy since Sx = Sy@x=y. 

(ii) By double induction. 

(iii), Gv) Double induction with respect to x and y. 

(v) From (iii) and (iv). 

(vi) In one direction we have to show x + y # 0 > Sy + x = 0. This can 
be shown by double induction on x and y.0 + y #0 > Sy + 0 = 0 since 
O0-y#0 and Sy-—~0=0 are both false) x =x-~0#4#0>x-05 
S(prd(x)) = x (2.4(vi)), hence SO — x = SO — S(prd(x)) = 0 — prd(x) = 0 
(2.4(i)). Induction step: suppose x > y #0 > Sy+x=0 (induction 
hypothesis), Sx -— Sy # 0. Then Sx = Sy = S(prd(Sx + Sy)) = 
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S(prd(x ~ y)) = x + y (applying 2.4(vi), and 2.4(i) twice), hence by our 
first assumption Sy ~ x = 0, and therefore also SSy + Sx = 0 (again by 
2.4(1i)). 

In the other direction, observe that y < x > y < x, and also y # x; 
since x < y and y < x would imply y = x by (ii), it follows that y < x > 
ax<y. O 


2.9. PROPOSITION. For allA PRA + AV —A. 
ProoF. Formula induction with 2.8(i) as basis (exercise). O 


REMARK. This proposition shows that it does not matter whether we adopt 
CPC or IPC as our logical basis for PRA; PRA is a fragment both of 
intuitionistic first-order arithmetic HA (defined in section 3) and of classical 
first-order arithmetic PA (= HA‘). 


2.10. PROPOSITION. 
Gi) x+ S0= Sx; 
Gi) SO+x=Sx, 0+x=x; 
(iil) x +(yt+z)=(xt+y)+2z; 
(iv) xt+y=yptx. 


PROOF. (i) x + SO = S(x + 0) = Sx. 

(ii) Induction on x. 

(iii) Induction on z. Basis step: x +(y+O0)=xt+y=(x+y)+0. 
Now assume x+(y+z)=(x+y)+z. Then x+(y+Sz)=x+ 
S(y +z) = S(x + (y +z) = Sx ty) tz) =(xt+ y) + Sz. 

(iv) Induction on y. Basis step: x +0 =x =0+ x. Assume x + y = 
ytx. Then x+ Sy=S(x+y)=S(y+x)=yt Sx =y t+ (SO+ x) 
=(y+s0)+x=Syt+x. O 


2.11. PROPOSITION. 

Gj) x-~y=Szex=y+t Sz; 
(li) x<yAy<z>x <2z; 
(li) xS yAysz7x<z. 


Proor. (i) By double induction w.r.t. x and y. Basis step: x — 0 = Sze 
x=Szex=04+ Sz and 0-y=Sze0=y+ Sz =S(y +z) since 
both sides of this equivalence are simultaneously false. For the induction 
step, assume x > y=Szex=y+Sz. Sx —Sy=Szex-y=S8ze 
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x=yt Sz; x =yt Sz > Sy + Sz = Sz + Sy = S(Sz + y) = SCy + 
Sz) = Sx; Sx = Sy + Sz — Sx = Sz + Sy = S(Sz + y) = S(y + Sz) > 
x =y + Sz. Hence Sx — Sy = Sz @ Sx = Sy + Sz. 

(ii) Assume x < y, y <z, then =y < x, =z < y, so by 2.4(v) p> x= 
S prd(y — x), z— y = S prd(z — y). Let u = prd(y -— x), v = prd(z — y), 
then y= x + Su, z=y + Sv by (i), so z = (x + Su) + Su=x+ (Sut 
Sv) = x + S(Su + v), hence x < z. 

(iii) Use (ii) and x<yoex<yVx=y,yszey<zVy=z. O 


The preceding propositions may suffice as an illustration of a quantifier- 
free development; for more examples we refer the reader to the exercises 
and the literature (see the references in 10.2). 


3. Intuitionistic first-order arithmetic HA 


3.1. Description of HA. Intuitionistic first-order arithmetic HA, also called 
Heyting arithmetic, is based on IQC with equality, and contains a constant 
0 (zero), a unary function constant S (successor) and function symbols for 
all primitive recursive functions. As axioms we have, besides the equality 
axioms for this language, 


-—S0=0, 
defining equations for all primitive recursive functions, such as 
x+0=x, x+Sy=S(x+y), 
x-0=0, x-Sy=x-ytx, 
and the induction rule 
A(0), A(x) > A(Sx) = A(t) 
which implies the induction-axiom schema 
B(0) A Wx( B(x) > B(Sx)) > Wx B(x). 


In a natural, deduction formulation we can adopt the same rules as 
indicated at the end of subsection 2.1. 
As in the case of PRA we have 


3.2. PROPOSITION. In HA 1 is definable as 0 = SO. 
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PRrooF. Straightforward extension of the proof for PRA. O 
In addition, it is now also possible to eliminate disjunction. 


ProposiTIOn. Jn HA A V B is explicitly definable as 


ax[(x =0 >A) A(x #0>8B)]. 


ProorF. Itis not sufficient to show thatin HA(A V B) © 4x[(x =0 > A) A 
(x #0 — B)]; we must actually show that for the defined disjunction the 
rules and/or axioms become derivable from the rules and axioms for the 
other logical operators, together with induction. Let us verify this 
for the natural deduction formulation. The V I-rules become derived rules 
by the following deductions 


(1) 0#0 0=0 S040 S0=0 (1) 
a ae Tee 

4 j= = 7 — a) a 

0-054 0#0>B SO=034 S0O#0>B 

(0=0—> A)AO#0—>B) (S0=0—>4) A (S0#0) > B) 

x(x = 0 >A) A(x #0 BD) Ax[(x=074) A (x#0> 8) 


As to the V E-rule, assume derivations 


[4] [8] 
Q 
C 6 


to be given. Then the following proof tree establishes V E for the defined 
Vv: 


(the double line stands for several deduction steps) (1) x =0 x = Sy (2) 
0 = Sy 
(3) (x =0 >A) A(x #0 > B) (3) (x =0 + A) A (x # 0 B) fe 
x=0>A4 x=0(4) x#0-8B x#0 
[A] [B] 
gQ’ 
4) (2) 
x=0-C ( x=Sy>c 
Se ees Ge a te ee (induction) xm x 
dx[(x = 0 — A) A(x #0 B)] Cc 


c (3) Oo 
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3.3. The further development of arithmetic itself can now be given along 
routine lines (cf. e.g. Kleene 1952, chapter 8). In the presence of quantifiers, 
many proofs from PRA can be shortened considerably. As an example, let 
us give another proof of proposition 2.5. 

Assume HA + A(0, y), HAF A(x, p(x, y)) > A(Sx, y). Then HA + 
VyA(O, y), HAF Vy A(x, y) > A(Sx, y), hence also HA + VyA(x, y) > 
Vy A(Sx, y); now by ordinary induction applied to B(x) == Vy A(x, y) we 
find | Vx B(x), so Vx Wy A(x, y). O 

It should be noted that here, in order to obtain the rule for quantifier-free 
A, we use induction over the universal formula Vy A(x, y). 

Another example is provided by Vxy(x = y V x # y), which has a con- 
siderably shorter proof in HA (E3.3.1). 


3.4. The relationship between PA = HA® and HA. The theorems on the 
relation between IQC and CQC in 2.3.1-8 can be straightforwardly ex- 
tended to HA and PA respectively: 


PROPOSITION. Let ® be the Gédel-Gentzen translation as defined in 2.3.4, 
except that for prime formulas we now put (t = s)®& = t = s. Then 

(i) PAF A= HAE A®, PAL A @& AB, 

(ii) For A not containing V or 3: HAE A iff PAF A. 


Proor. The modification in the definition of 8 is justified by the fact that in 
HAF —-(t =s) >? t=s. 

(i) follows from the corresponding theorem for logic by the observation 
that for any mathematical axiom A of PA, A® holds in HA, and (ii) is 
immediate from the definition of ®& O 


The second part of the proposition explains why so many theorems 
provable in PA can be lifted without any difficulty to HA: many well-known 
and important theorems from elementary number theory are expressible in 
the 4-free fragment; in fact, they are often purely universal (example: the 
quadratic reciprocity law). The proposition does not cover theorems of the 
form Vx dy A(x, y), A quantifier-free (i.e. theorem in [T$-form). However, 
it can in fact be shown that also 


PA + 4x A(x, y) = HAF 4xA(x, y) (1) 


for primitive recursive A; a proof is given in the next subsection and 
another proof is given in 5.3. The rule (1) is directly connected with the rule 
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MR pg, Markov’s rule for primitive recursive predicates: 
MRpp HA —3x A(x, y) = HAF 3x A(x, y), 

A primitive recursive. 


For HA + —~4x A(x, y) @ AVxA(x, y), and AVxA(x, y) = 
(Ax A(x, y))8, so by (i) of the proposition (1) implies MR pp and vice versa. 
The following is based on the digression 2.3.9-26, and may be skipped. 


3.5. PROPOSITION (corollary to 2.3.26). 

(i) Let A be essentially isolating or \-wiping. Then PAt A = HAE A. 
(ii) PA t Vx dy A(x, y) = HAF Vx Sy A(x, y) for quantifier-free A. 
(iii) HA is closed under MRgp. 


ProorF. (i) The arithnietical axioms of PA are M-closed under 8, since they 
are given by spreading formulas and formula schemas; 1 occurs only 
positively in I’, namely in SO=O>1. 

(ii) In HAF A(x, y) @ p(x, y) = 0 for some primitive recursive func- 
tion symbol ; 4y(p(x, y) = 0) is isolating, hence PAF JyA(x, y) @ 
HA + JyA(x, y). 

(iii) follows by the remark above. O 


As an example of a principle of PA not valid in HA we consider 


3.6. The least number principle. In classical arithmetic one can show that N 
is well-ordered with respect to arithmetic predicates, this is expressed by the 
least number principle 


LNP 3x B(x) > 4x[B(x) A Vy < x3B(y)] 
which is classically equivalent to transfinite induction over N: 


TI Vy[Vx < yA(x) > A(y)] > Vx A(x). 


w 


To see this, apply TI,, to , B(x) and take the contraposition; the converse 
is also easy. Intuitionistically, LNP implies PEM for HA: apply LNP to 


A(x) = (x =OAP)V(x=1A4P)Vx=2. 


3x A(x) is obvious; assume there is a least y such that A(y), then 
y € {0,1,2}. y=2 would imply —A(0),4A(1), hence —(0 = 0A P), 
3A(1 =1A—-P), ie. =P and —4P which is contradictory. If y = 0, it 
follows that P; if y = 1, it follows that —P. 
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On the other hand we have the 


PROPOSITION. =4LNP holds in HA, that is to say 
HA + ——[3x A(x) > 3x(A(x) A Vy < x5 A(y))]. 


First proof. We first give a direct proof. Since in intuitionistic logic —— 
(P > 92) 8 (HP > 0) 9 =-(P > Q) it is sufficient to show 

3x A(x) — —3x( A(x) A Vy < x4(y)) (1) 
which is equivalent to 

Yx[A(x) > -432(A(z) A Vy < zA4(y))]. (2) 


We show this by applying TI,,, which can easily be shown to follow from 
the induction principle in HA. Assume 


Vu < x[A(u) > 4532(A(z) A Vy < z5A(y))]. (3) 
We have to show 

A(x) > 4432(A(z) A Vy < z5A(y)) 
so asume 

A(x). (4) 


If 43y <xA(y), then with (3) ~32(A(z) A Vy < z4A(y)), and if 
ady < xAy, ie. Vy < xA(y), we have A(x) A Vy < x4A(y), 80 —H3x 
(A(x) A Vy < xA(y)). Therefore we have from (3) and (4) 


dy <x A(y) V Wy < xn A(y) > -3z(A(z) A Vy < z5A(y)), 


and since from P V ~P > ——@ it follows that —(P v =P) > ——Q, 
and hence —4Q, we obtain from (3) 


A(x) > —4x(A(x) A Wy < x5 A(y)). 
So (2) holds. O 


Another argument for ——LNP is given below; this depends on the work 
in section 2,3, 


Second proof (a metamathematical argument for .LNP). We use proposi- 
tion 2.3.26(ii). —.LNP corresponds to the schema 


aA(4x P(x) > ax( P(x) A Wy(y < x > 4P(y))). 
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The only positive occurrence of V is in Vy(y < x ~ —P(y)), and this does 
not contain a strictly positive occurrence of a d-formula; so the conditions 
for the Mints—Orevkov theorem apply. The arithmetical axioms are M- 
closed under ® and contain t only positively, hence from PA(P) + 
=ALNP it follows that HA(P) + —.LNP, and thus HA t+ ——LNP. Here 
HA(P), PA(P) are formulated as PA, HA but with a unary predicate letter 
P added to the language. O 


4. Algorithms 


4.1. The notion of an algorithm. One possibility for a sharper delimitation 
of the mathematical objects (= constructions) occurring in constructive 
mathematics is to require all objects to be representable as algorithms; that 
is to say each object is represented by an algorithm operating on finite 
configurations in a completely deterministic way. In particular, we may 
think of functions from N to N as given by algorithms. And since the 
presentation of (description of, programme for) an algorithm is itself 
finitely presentable, it also makes sense to have algorithms operate on 
algorithms, etc. 

Since we may assume finite (finitely representable) configurations to be 
coded in some way by natural numbers, there is no loss of generality if we 
restrict our attention to algorithms applicable to natural numbers, and 
which produce natural numbers as output. 

We shall not assume algorithms to be total on the natural numbers; for 
certain numbers the computation (i.e. the process of applying the algorithm 
to an argument) may break off without producing a result, or keep on 
running without terminating. The primitive recursive functions PRIM, 
which we encountered in section 1 are a very important class of total 
algorithms on N, but by no means exhaustive. For example, note that one 
can effectively enumerate all unary functions in PRIM as fo, fi, fa,--. Say; 
then f given by f(x) :=f,(x) +1 is algorithmic and total but not in 
PRIM. 

A more general class of total algorithms on N is given by the following 


DEFINITION. The class TREC of total recursive functions is the least class of 
functions satisfying the closure conditions of PRIM (with TREC replacing 
PRIM in the clauses) and in addition: 


(iv) If g is in TREC and Vy 4x(g(x, y) = 0), then so is f given by 


f(¥) = min, [ g(x, 7) = 0] (closure under minimalization). 
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Here min ,[ g(x, 7) = 0] is the least x for which g(x, 7) = 0 if such an x 
exists, undefined otherwise; min, is called the minimum operator. O 


Below we use = as in logic with existence predicate: t = ¢’ means that r 
is defined iff ¢’ is defined, and if they are defined, then their values are 
equal. 

We also introduce the class REC of partial recursive functions by the 


DEFINITION. The class of ( partial) recursive functions is defined by 

(ij) TREC c REC. 

(ii) REC is closed under composition, that is to say if g),..., g,, & REC, 
h © REC then also f defined by f(x) = h(g,(X),..., 2,,(X)) belongs 
to REC. Here the right hand side is defined iff g,(x),..., g,,(X) are all 
defined and have values y,,..., y,, Say, and h(y,,..-, y,,) is defined 
(the functions are strict in the sense of E-logic and E*-logic). 

(iii) if g € TREC then f, defined by f(y) = min,[g(x, ¥) = 0], is in REC. 

O 


One easily sees that the functions in REC are algorithmically comput- 
able. On the other hand, the definitions of REC and TREC are concise and 
easy to manage, but do not give a very clear idea of the extent of the notion 
of recursive function. It turns out however that there are many equivalent 
formulations describing the same class of functions: functions computable 
by Turing machines or register machines, A-definable functions, functions 
definable by the Gddel—Herbrand equation calculus, Post’s normal systems, 
Markov algorithms (cf. 1.4.6); and analysis of the informal concept of an 
algorithm (mechanically computable function, or function computable by a 
human operator in a mechanical way), has provided strong grounds for 
identifying these informal concepts with the mathematically precise notions 
just mentioned. It would carry us too far to present here the details of such 
a concept analysis; see e.g. Kleene (1952, chapter 13). 

Later on we shall not be overly precise in our use of “recursive”. When 
we want to stress that f © REC is total we call f a total recursive function, 
and we use “partial recursive function” to keep on the safe side, or when the 
reader deserves a warning. 

Intuitively, we think of an algorithm as an effective procedure that 
operates in discrete steps on a finite configuration of some kind, which is 
kept in a “working space” (a tape, a register, a piece of paper for a human 
agent). The algorithm is given by a finite set of instructions, which we call a 
programme. 
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The following features are inherent to the notion of an algorithm: 

(1) A programme (for an algorithm) is described by means of finitely many 
symbols in some fixed alphabet. 

(2) The execution of the programme is deterministic: each step uniquely 
determines the next step in the computation. 

(3) At each step only finitely many symbols are stored in the memory, and 
so the next stage of a computation is always determined from a finite 
set of data. 

(4) There is no upperbound to the size of the memory or the length of 
strings of symbols handled. 


4.2. On the basis of (1)-(4) above the following assumptions seem to be 
justified. 

(A) All f © REC are given by algorithms. 

(B) Programmes and computations are coded by natural numbers. 

(C) Let us call a predicate or relation algorithmic iff its characteristic 
function is algorithmic. Then “z codes a (terminating) computation accord- 
ing to programme x for arguments y” is an algorithmic predicate of x, ¥ 
and z. Let us denote this predicate by T (“Kleene’s T-predicate”’). There is 
a total algorithm U (the result-extracting function) which extracts the result 
from the code for a terminating computation. The instruction “Apply the 
programme with code z to X” is itself algorithmic in z and x. 

Thus each algorithmic @ has a code, x say, and is representable as 


7(¥) = U(min, [x7(x, ys; z) = 0]), 


or min,7(x, y, z) for short, where x, is the characteristic function of the 
predicate T. 

(D) Let {x }(),,..., y,) be the (partially defined) expression denoting the 
result (if existing) of applying programme x to arguments y,,..., y,- The 
(in general partial) function defined by the algorithm with code x is usually 
indicated by means of the so-called “Kleene brackets”: {x}; sometimes, if 
we want to indicate the number of arguments, we write {x}". 

If we consider {x}(y, Z), this represents for each fixed y an algorithm 
applied to z. The code of this algorithm depends on x and y; the code may 
be assumed to be found by an algorithm from x and jy, ie. there is a total 
algorithm @ such that {x}(¥, Z) = {@(x, ¥)}(Z)(the smn-theorem). 

(E) Any term constructed by means of { }(_), variables and constant 
numbers is an algorithmic function of its free variables (the value of such a 
term is regarded as defined if and only if all its subterms are defined). 
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4.3. For the mathematical consequences of “everything is algorithmic”, the 
properties (A)-(E) of algorithms listed in 4.2 are sufficient. For meta- 
mathematical work, one wants to know more; and the detailed analyses of 
the notion of an algorithm mentioned in 4.1, which motivate the identifica- 
tion of REC with the class of algorithmically computable functions, show 
that T and U in 4.2(C) and in 4.2(D) can in fact be chosen in PRIM. 


4.4, DEFINITION. A predicate S is said to be recursively enumerable (ab- 
breviation: r.e.) if, for some recursive R, S(y) := 4x R(x, y). 

If S is re. then S is the domain of a recursive function, namely 
min, [xX z(x, ¥) = 0] as a function of y, and conversely if S = dom(/) and 
f has code x, then S(y) Jz T(x, y, z). Thus we may also define: S is re. 
iff S is the domain of a recursive function. O 


PROPOSITION. X = {x: dy Txxy} is r.e., but not recursive. 


PRoor. Suppose X to be recursive, then N\ X is recursive, hence 
for some x9 N\ X = {x:4y T(x, x, y)}. Then x, EN \ XO 
Ay T(X9, Xo, Y) Xo © X, which is contradictory. O 


4.5. DEFINITION. Two sets X,Y CN, XQ Y= @ are said to be recur- 
sively inseparable if there is no recursive set Z such that X C Z, YCN\Z. 


PROPOSITION. The r.e. sets 
A= {x:4y(Txxy A Uy=0)}, B= {x:4y(Txxy A Uy = 1)} 


are recursively inseparable. 


PRooF. Suppose B C C, ACN\C, C recursive; let z be a code for x¢, 
then 


x€A>xEC © Fy(Tzxy A Uy =1), 
x€B>xeECe Iy(Tzxy A Uy = 0). 


Now z € C © 4y(Tzzy A Uy = 1) 22 © B, hence z € C; similarly z E C 
— z € C. Thus we have obtained a contradiction. O 


4.6. PROPOSITION (recursion theorem). 


Wx dy W2({x}(y, Z) = {y}(2)); 
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more generally, for every x there exists a (primitive) recursive , such that 
wyE({x}"*""(@(y), FZ) = (4(9)} (2). 
Proor. Let w be such that (smn-theorem, cf. 4.2(D)) 


(o(u, y', Y)}(2) = (u}O, ¥, 2). 


Let v be a code for the partial recursive function AuyZ.{ x }(¥(u, u, 9), ¥, Z) 
and take 9(y) = W(v, v, y), then 


{9(¥)}(Z) = {0} 0, ¥, 2) = {x} (¥(0, 2, 9), ¥, 2) 
= {x}(0(9), 7.2). 0 
An application is given in the following 


EXxaMPLE. The recursive functions are closed under a more general form of 
minimalization: if (x, y) is partial recursive then so is 


W/(¥) = min, [y(x, ¥) = 0]. 


Here ’(y) is defined and equal to x iff ~(x, 7) = 0 and for all x’ < x 
(x, ¥) is defined and unequal to 0. To see this we argue as follows. Let y 
have a code z, and let » be a total recursive function such that {pz }(x, y) 
= {z}(x + 1, y) for all x and y. Let {v} be a partial recursive function 
such that 


0 if {z}(0, ¥) = 0, 


{v}(u,z, y) = Pies y) +1 if {z}(0, y) defined and + 0. 


The definition as it stands does not immediately show that {v} © REC. To 
see this, write 


{v}(u, z, ¥) = 
se({z}(0, ¥)) - [U( Amin, [(7(z,0, ¥, jw) A Ujw = 0) 
V(T(z,0, ¥, Aw) A Uw > 0A T(u, 92, ¥, jw))]) +1]. 


By the recursion theorem there is a vg such that {vy }(z, ¥) = 0 if {z}(0, y) 
= 0, {v9 }(z, ¥) = {U9 }(z, ¥) + 1 if (z}, ¥) > 0. It is not hard to see 
that {v9 }(z, y) = min, [{z}(x, y) = 0]. O 
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In a similar way we can show closure under a general form of primitive 
recursion: given W, x © REC there is a pg € REC such that 
ee ¥)=0(y), 
p(x + is y) = x(@(x, y), Fe): 
4.7. The following type of definition is legitimate and can be used for 
introducing a partial recursive function: 
f(X) if 3yR(Xy), 
A(x) if Ay R(x, y), 
v(x) = ars 
f(x) if IyR,(%, y), 
undefined otherwise, 


where R,,...,R,, are recursive, f,,...,f, partial recursive, and the 
JyR,(x, y) mutually exclusive. To spell it out: suppose fo,..., f, to have 
codes é€g,...,e,, then 


v(X) = Uj, (min, [Fi < n(T(e,, ¥, jy) A R(X, by))])- 


4.8. Let us call an enumeration (f,), of unary algorithmic functions 
effective if f,(m) is algorithmic in n and m. The following proposition 
shows that the particular coding chosen for algorithms is irrelevant, as long 
as the codings are effective. 


PROPOSITION. Let ¢f,,)n,(Sn)» be two effective enumerations of algorithms, 
then there is an algorithmic permutation 6 of N such that fy, = g,, (i.e. 8 is a 
bijection from N to N). 


ProoF. See e.g. Richman (1983). O 


5. Some metamathematics of HA 


5.1. Markov’s rule. Below we shall present a simple proof (Friedman 1978 
and Dragalin 1980) of closure of HA under the rule MR pz (Markov’s rule 
for primitive recursive predicates): 


MRpg b dx Ax = 3xAx (A(x) € PRIM); 


another proof was given in 3.5(ii). In fact, more is true: HA is also closed 
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under the rule 
MR Fb Wx(Ax V=Ax) and + -A4dxAx = + AX AX 


(for a proof see E4.4.8). 


5.2. DEFINITION. Let A be any fixed formula of HA. For arbitrary for- 

mulae B we define B4 by 

(i) P4:= PVA for P prime, 14 = A, 

(ii) (BeC)4 = B40C4 fore E{A,V, >}, 

(iii) [((Qx)B]4 == (Qx)B4 for Q © {V,3}, with the proviso that where x 
occurs free in A, we first replace all free occurrences of x in B by a 
variable not occurring free in A. O 


5.3. LemMa. Let [4 = {B4: BET}, and let + be derivability in 1QC. 
Then 

(i) AK B4, THE B>I4+ BA; 

(i) HAL B= HAE BY: 

(iii) for any term t 


HA + +—4x(t(x, y) = 0) = HAF 3x(t(x, y) = 0). 


PRoor. (i) 4 + B% is proved by a straightforward formula induction. The 
second part uses induction on the height of deduction trees of P+ B. Let 
us consider the induction step where the final rule applied is 1 ;, so the 
derivation is of the form 


By the induction hypothesis and A + B4 we have deductions 


[r] [A] 
g’ Q" 
A Be 
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and thus 


All other steps are equally trivial. 

(ii) Since by (i) the 4-translation preserves derivability it suffices to show 
that HA proves the 4-translation of its axioms. E.g. (Sx =0—> 1)4= 
(Sx = 0 V A) > A which holds in HA. 

(iii) Apply (ii) with A := 4x(t(x, y) = 0). O 


From this lemma we immediately get 


5.4. COROLLARY. HA is closed under MRpg. OU 


Another application of the translation 4 is the following. 


5.5. THEOREM. HA is closed under the “independence of premiss” rule 
IPR + A > 4xB=>+ 3x(—=A > B) (x not free in A). 


Proor. (Visser 1981, part 6, 3.1). Assume HA + 4A ~ 4x 8B. Then by 
5.3(ii) 


HA + (44)77%4 > ax(B)7™. (1) 
Now observe that by induction on D we can prove for any C, 

Dt=AC> (DoD), 
and thus also 

+ D© + (4C > D), (2) 
hence 

+ (B)~* = (44 > B) 
and so 

HA + ({A)7™* = 4x(4A > B). (3) 
On the other hand, (~A)774 = (A > 1)774 which is equivalent to 
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A774 — —\ A. If we apply (2) with D = 4A,we find + A774 > (44 > 
A); also, by propositional logic + (,A ~ A) > 44, so b A774 RO 
=A, hence + (4A)77~4 and therefore with (3) 


HAF 4x(,4 > 8B). O 


5.6. Disjunction and existence properties. All well-known intuitionistic for- 
malisms containing arithmetic have the following properties for sentences 


DP -F}AVB>F-A or FB 
(the disjunction property) and explicit definability for numbers 
EDN + 4xA(x) =F A(v) for some numeral 7. 


This agrees well with the informal reading of V and 3: a proof of A V B is 
given as a proof of A or a proof of B, i.e. from an intuitionistic argument 
for A V B we should be able to extract an intuitionistic argument either for 
A or for B. Now, if in an intuitionistic formalism S we prove St A V B, 
the formal proof is justified by an informal intuitionistic argument for 
A V B, from which we can then extract an intuitionistic (informal) argu- 
ment for either A or B. However, it is not self-evident that this informal 
argument can in turn be translated into a formal proof forS + A or SF B. 
This is a kind of closure property on the set of proofs represented by formal 
deductions in S, and it is certainly possible to construct systems which are 
intuitionistically obviously correct, though somewhat artificial, for which 
DP fails (see e.g. Troelstra 1973, 1.11.2). 

The same, mutatis mutandis, might be said with respect to EDN; in fact, 
there is a close connection between the validity of DP and EDN, as shown 
by Friedman (cf. 10.5). 

We shall now show how to prove DP and EDN (more in fact) with help 
of the (Aczel-) slash relation. 


5.7. DEFINITION. I is a set of sentences of HA, and deducibility + refers 
to deducibility in HA. 
For sentences A we define 


@) I|A = + A for A primeorA=1, 

(ii) TJA A B =T|A and |B, 

(iii) [|A V B = TA or TB, 

(iv) TPJA > B= T|A>T|Bandr+A-B, 

(v) [|WxAx == Ib Wx Ax and [An for all numerals 7, 
(vi) I'|3xAx «= TI|An for some numeral n. O 
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LemMA. [|A > Tb A. 
ProoF. By induction on A. O 


5.8. LEMMA. Let ¢,,..., ¢,, be closed terms, t,,...,, the numerals obtained 
by evaluating them. Then for closed A(t,,..., t,) 


T\A(t,,.--,t,) @ TIA(G,-.. t)- 


PRooF. First observe that + t = 1 for closed terms ¢. This is most easily 
proved as follows: assume all primitive recursive function symbols to be 
enumerated as @,,,-.. such that , is defined using only 9, for 
i <n. Establish by induction on n that 


b @( My, --+5 m,) = @, (171... m,) 


is provable for all m,,..., m,, then prove by induction on the complexity 
of ¢ that + t =7¢. (N.B. The proof actually shows PRA + ¢ = 7.) From 
HA ¢;= 1, and the equality axioms/rules we get + A(t,.-.,¢,) @ 
PE A(h,..., t,)- 

The lemma itself is now proved by induction on A. For example, let 
A=B-C, and write A’, B’,C’,... for the result of replacing in 
A, B,C,... the terms ¢,,...,¢, by #,...,¢,; then by the induction hy- 
pothesis, 


TIB>Ce(rB=T|C) and TrHBOC 
e ((T|B’ = T|C’) and T+ B’>C’) 
«> TB’ > C’. 


Or if A = Vx B(x), we have ['|Wx B(x) @ (I+ Vx B(x) and for all n 
I'|B(”)), which is equivalent to [+ Wx B’(x) and for all n I'|B’(n), and 
thus to P|Wx B(x). O 


5.9. THEOREM. If I’|B for all B € I, then 
PRAT. 


PRrooF. Notationally the simplest solution is a proof by induction on the 
length of a proof in a Hilbert-type axiomatization of HA (2.4.1, 3.1). 
We shall check some cases and leave the others to the reader. 
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For formulas F with FV(F) C {x,...,x,} we define I|F := 
[+t Wx,...x,F. So in order to prove the proposition we only have to 


prove '|F(m,,...,m,,) for all choices of numerals m,,..., m 


Case 1. If A © I, then + A by assumption. 


We first consider some logical axioms. It is no restriction to consider only 
closed instances of the axioms. 


Case 2. AN B-A. By definition [JA A B > T|A, and by logic Db 
AA B-A trivially, hence [|A A B= A. 


Case 3. A > (B—A A B). We have to show T+ A > (B >A A B)and 
r'\|A = I|B > (A A B). The first is obvious; as to the second, let I'|A 
and assume I"|B; then I'|A A B, and by the lemma of 5.7 also + A A B, 
and thus I'|B > A A B. Thus we have shown '|A = (T|B > A A B). 


Case 4. To show I'|(A > C) > [((B > C) > (A V B > C)j (1). Consider 
TA > C (2), T|B>C (3), 
T|A v B (4), T|AV B->C (5), 
T\(B>2C)>(AVB—C) (6), FIC (7). 
Call the corresponding statements with | replaced by + (1’)-(7’). We will 
repeatedly use the lemma of 5.7. Since (1’) holds, we have to show 
(2) = (6). So assume (2), then also (2’). In order to obtain (6), we have to 
show (6’)- which follows from (1’) and (2’)- and (3) = (5). Assume (3), 
hence (3’) holds. In order to obtain (5), we have to shown (5’)— which 
follows from (3’) and (6’) - and (4) = (7). Now, by definition (2), (3) and 
(4) yield (7); so on the assumption of (2) and (3) we get (5), which was what 
we required. 


Case 5. [|At > 4x Ax. Let 'jAt, t a closed term. Then there is a numeral 
n such that + t = n, hence (lemma 5.8) I'|An, so '|4x Ax. Then '|At > 
4x Ax. 

We now consider the rules 
Case 6. T|A > B, IjA > IB. By the first assumption T'|A = I'|B, hence 
IB. 
Case 7. T'|A(x) = T'|Wx A(x); x not free in I’. This holds by definition. 

From the arithmetical axioms, we consider only the induction schema. 
Case 8. Assume 

T|A(0), P|Wx( A(x) ~ A(Sx)), 
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then in particular, for all a 
T\An > A(Sn) 
and hence by induction for all n I'|An, therefore [|Wx A(x), etc. O 


5.10. THEOREM. HA has DP and EDN. 


Proor. Because of HAF A V B @ 4x[(x = 0 > A) A(x #0 B)], it 
suffices to consider EDN. Suppose HA + 3x Ax; then |4x Ax, so |An and 
therefore + An, forsome n. ODO 


REMARK. The fact that DP holds also shows that HA is properly contained 
in PA: let A be a Gédel-sentence (see e.g. Kleene 1952, section 42), ice. 
HA + A, HA K —4; clearly PAt A V —A, but not HAF A V —A since 
this would conflict with DP. 

In fact the soundness theorem for the slash yields more: 


5.11. ProposiTIon. If I'|C for all C € T, then HA + I has DP and EDN. 
Proor. As for 5.10. O 
In particular, we have the 
5.1.2. COROLLARY. For sentences C, if C|C then 
HA+ C > 3xAx > HA C-An_ forsome zn. 

Proor. The proof is trivial once the deduction theorem I, AF B= It 
A > B is available for the formalism of HA. For Hilbert-type systems the 
proof of the deduction theorem is implicit in the proof showing equivalence 


with natural deduction formulations (cf. 2.4.2) O 


It is interesting to note that the slash operator gives in fact an optimal 
criterion: 


5.13. PROPOSITION. Suppose C closed, and for all closed Ax Ax 


HA + C > 4x Ax => HAF C > An for somen EN, 
then for all closed D such that C + D also C|D; which implies C\C. 
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Proor. By induction on D. We again use the deduction theorem. 
Case (a). For D prime the conclusion is obvious. 


Case (b). D = Ax Ax, and C + 4x Ax, then by hypothesis + C — An for 
some n, i.e. C + An; then by induction hypothesis C|An, and thus C|4x Ax. 
We leave the other cases to the reader. O 


The preceding proposition also shows that the slash relation respects 
logical equivalence: 


5.14. CoROLLaRY. If C|C and | C@ C’, then C'|C’. O 


5.15. For the actual application of 5.12 it is convenient to have a syntactic 
criterion on C which is sufficient for C|C. We present one here. 


DEFINITION. The class 2#¥ of Rasiowa-Harrop formulas is defined as the 
class of those formulas which have no J or V as a s.p.p. (strictly positive 
part, cf. 2.3.23). O 


We have the following 


5.16. PROPOSITION. Let C be a sentence in RH, and 4xDx,AVB 

sentences. Then 

(i) if C € R# then C\C; and 

(ii) H C > 4x Dx = + C > Di for some numeral n. 
FC?PAVB>-CHAorktCneB. 


Proor. The proof of (i) follows, under the assumption C € 2#, from 
(a) if D is a s.p.p. of C, then C|D; 

(b) if A A B is ans.p.p. of C,C|A and C|B, then CjA A B; 

(c) if A > B is ans.p.p. of C and C{B then C|A > B; 

(d) if VxA is an s.p.p. of C and for all n C|A[x/n], then C|Vx.A; 

(e) if P is prime and P an s.p.p. of C, then C|P. 

Part (ii) readily follows from (i) with help of the deduction theorem. O 


5.17. COROLLARY. Since A € 2H for all A, we have in HA: 
(iii) KH 4A > 3x B(x) =H AA > BCH) for some numeral n, 
(iv) FAAP BVC>+-=A>Bor FAAP C. O 
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6. Elementary analysis and elementary inductive definitions 


6.1. In this section we discuss the definition of sets by means of inductive 
definitions, possibly with function parameters, and show how to replace 
inductive definitions by explicit definitions in arithmetic or elementary 
analysis EL, an extension of HA with variables and quantifiers for number- 
theoretic functions (i.e. functions in N — N). In the next section we shall 
use the theory of inductive definitions to describe, in outline, a smooth 
formalization of basic recursion theory. The results of this section are also 
used in section 9.7. 


6.2. Elementary analysis EL. We first describe elementary analysis, an 
extension of HA which is in fact conservative over HA: if we interpret the 
function variables of EL as ranging over total recursive functions, the 
axioms and rules of EL involving functions translate into theorems and 
derivable rules of HA; the details of this proof rely on the formalization 
of elementary recursion theory in HA, which is carried out in the next 
section. 

The language (EL) contains, in addition to the symbols of HA, unary 
function variables, denoted by metavariables a, 8, y, 6,..., the application 
operator Ap, the abstraction operator \ and the recursor r. As usual we write 
(z), or even pt, for Ap(q, ¢). 

Functors (function terms) and (numerical) terms are defined simulta- 
neously by 
(i) _ the clauses for terms in HA; 

(ii) function variables and unary function constants are functors; 
(iii) if p is a functor and ¢ a term, then @(f) is a term; 

(iv) if ¢ is a term, then Ax.¢ is a functor; 

(v) if ¢ and ¢’ are terms, and @ is a functor then r(t, 9, ¢’) is a term. 

The logic of EL is two-sorted intuitionistic predicate logic. As non-logical 
axioms we have the axioms of HA, with induction extended to formulas of 
(EL), 


CON (Ax.t)s = t[x/s] (A-conversion). 
axioms for the recursor (p unary, 9(x, y) = p(x, ¥)) 
r(t,p,0) =, 
r(t, 9, St’) = 9(r(t, 9,0), 2), 
and the quantifier-free axiom of choice 
QF-AC Wx 4yA(x, y) 7 JaVxA(x, a(x)) 
(a € FV(A), A quantifier-free). 


rec | 
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6.3. NOTATIONS in EL. Equality between functors is defined: 


p = b= x(x = Px). 


Coding of n-tuples, finite sequences etcetera, can be lifted from numbers to 
functions via A-abstraction: 


vP(ay,...,0,) = Ax.v?(a,x,...,4,x) 
JE(a) =Ax.jf(ax) forl<k<p, 
{Wiggs ia's My) TAKA Og). s24/ OX) 


By means of r we can introduce ax satisfying 


a0 =< ), 
a(x +1) = ax*ax), 
ie. a(x + 1) = (a0,..., ax) (take @x = r(( ), Ax jx *(a(j.x))), x)). 


6.4. Inductive definitions. Inductive definitions are used frequently in 
mathematics for defining a set of objects as the least set satisfying certain 
conditions. For example, the A-generated subgroup H of a group G, for 
some inhabited A C G, is the least subgroup H of G containing A; we 
think of H as obtained by using exclusively the following clauses 
(i) a€Ar>aed, 
(ii) abe H>a-b' EH, 
(here - is the group multiplication, ~! the inverse operation in G). In other 
words, we put nothing in H except what is obtained by repeatedly applying 
(i) and (ii): H is generated by (i) and (ii). 

Let us look at some arithmetical examples. 


Example I. The following clauses generate the graph P of the pairing 
function (n, m) > 273": 


Pl (0,0,1) € P, 

P2. (n,m,k) © P>(n+1,m,2k) € P, 
P30 (n,m,k) € P> (n,m+1,3k) €P, 
and P = {(n, m,2"3"):nENAmEN}. 
Example 2. 


Ql 1E€Q, 
Q2 neQAnmeQ-2"-3"EQ. 
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In this case an explicit definition is not so easy to give. Q contains 


1,2+3 = 6,2! = 36, 26-3, 25 + 36, 97°°3.3. 


The inductive clauses in these examples are closure conditions on the set 
X to be defined, of the form 


igeX AS AE Ve XACH LEX, (1) 


where C does not contain X, and f,..., ¢,-;,¢ and C may contain certain 
numerical parameters j. Slightly more generally we may consider clauses of 
the form 


Wi<t'(t,EX) AC PLEX. (2) 


Clauses of the form (2) or (1) can always be presented in a certain standard 
form where ¢ is replaced by a variable, namely as 


t=xACAIEXA:+:+ At, EXPXEX (3) 
or 

t=xACAWi<1t'(t;EX) 7 xEXx (4) 
(x not free in ¢, f,, t’ or C). We can get rid of the parameters by writing (4) 
as 

a(t =x ACAWI<t(t,E X)) 9x EX. (5) 


A finite number of clauses A,(X, x) > x © X may be combined into a 
single condition 


Ay(X,x) Vv i VA al Kok) eX (6) 

or with A = Ay V ++: VA,_} 
A(X,x) > x EX. (7) 
We can express the fact that the inductively generated set is the /east one 
satisfying the closure conditions, i.e. that elements are put in it solely on the 
basis of its closure conditions, simply by stating that any set satisfying the 
same closure conditions must contain the set generated by those closure 


conditions. Thus if P, is generated by a number of closure conditions 
B,(P,),---» By(Pa) say, we must have for any X 


B,(X) A --- AB(X) > (Py, CX); 
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or if we use the standardized version (7) 
VWx[A(X,x) > x EX] > [P, Cc X], (8) 


where P, is the set generated by A(X, x) > x © X. 

We can intuitively convince ourselves of the correctness of (8) without 
relying on the notion of the “least” set satisfying certain conditions as 
follows. Take e.g. example 2. The closure conditions may be written as 


1€Q, Inm(x=2"-3"AnEQAmMEQ) -xEQ. 


1 € Q holds “by an argument of depth 0”, i.e. 1 is given as being in Q 
immediately. An element of the form 2”- 3” is in Q by an “argument of 
depth less than y+ 1” if we have arguments of depth less than y for 
n€Q,me€éQ. Thus arguments for membership in Q take the form of 
finite tree structure, e.g. 


1€Q 1€Q 
1€Q ‘ Z Q 1€Q Q 
= ‘3 =6€E € 1¢é 
sel ‘ 
2-3°€Q 2-3=6EQ 
a a 
27%. 38 


For any predicate X satisfying the same closure conditions we can, 
starting at terminal nodes with “1 € X”, build up proofs of “t © X” for 
any node “rt € Q” appearing in such a tree, ie. Q C X. Note also that the 
collection of t € Q appearing at the roots of such trees representing 
arguments of finite depth indeed satisfies the closure conditions on Q. 

A similar justification of (8) also works for example 1 and more generally 
for any P, generated by clauses of type (4). The same idea will be used to 
obtain explicit arithmetical definitions for the P, later on. 

The standard form (7) for the closure conditions of our examples gives 
rise to monotone A, i.e. A satisfying 


A(X,x) AXCY>A(Y,x). 


In fact, any A obtained as disjunction of clauses J¥(t=x A CAWI< 0’ 
(t; © X)) is easily seen to be monotone. 
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Classically it is not difficult to show, for any A satisfying monotonicity, 
the existence of a least set P, such that 


A(P,,.x) 7x © Py 
Vx(A(X,x) > xEX)>(P,CX). °) 


To see this put 
P(X) = (x: A(X% x)} 
and define I’f for all ordinals a by 
Th = 6; 9) = 1 (Ug); Tp = UT a <A} 
for limit ordinals A. Then for some countable ordinal a ['g*! = If, and 
then If is the required P, (exercise). This justification relies on the 
classical notion of ordinal and is thus, in its generality, not available to us. 
However, in our example, and more generally for A constructed from 
clauses of type (4), as in (6), one has in fact already [?*+! =v, and 
x €I"*! corresponds to “x € P, by an argument of depth n”. For 
x € IF} =T,(@) means that x © P, immediately, and x € I? = I(T) 
means that x € P, can be shown by applying the closure conditions to 
certain x which are in I}, ie. which are in P, by “proofs of depth 0” 
etcetera. 
Next we shall generalize and formalize the ideas of the preceding 
informal discussion. We first define syntactically the class of closure condi- 
tions we are going to consider. 


6.5. DEFINITION. Let H be either HA or EL and let H(X) be the corre- 
sponding system with a letter X for a unary numerical predicate added. We 
define a class of formulas in H(X) inductively by 

(i) the formulas of H are in F; 

(ui) Xt © Y, where ¢ is a numerical term; 

(iii) if A, B © , then (A A B) and (A V B) EF; 

(iv) if A € FY, then (AxA) € F; 

(v) if A&P, taterm, x € FV(t), then(Vx <tA)EF. O 

It is readily seen that the formulas A € # are monotone in X. We shall 
now demonstrate the following 


6.6. THEOREM. Each A(X, z) of F is in H(X) equivalent to a formula of the 
form 

AxVy <t[x, z](P(x, y,z) V (Q(x, y,z) A Xt’[x, y, z])) 
for suitable terms t, t' and formulas P,Q = #(H). O 
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For the proof we need some lemmas. 
6.7. LEMMA. In H(X), for given to, t, and A, 
Wxq S tol y]Wx, < 4[xo, VJA(%o. 1) O 
vx < tL y]A(solx, »], [x,y], y) 
for suitable terms t, 5), 5, of & CH). 


Proor. The idea is to code pairs x», x, aS x =j(Xp, x), using suitably 
chosen s9, 5; to bound the possible values of j,x and /,x. Let 


v(y) = max{t,[xo, y]: x < tol y]}, 


tly] =s(tolv], ¥(y)). 
(This guarantees (Vx < tly]A( jx, (2x, ¥) > VXo S tol y]Vx, < tx, y] 
A(X, X,, y).) Now we choose so, s, such that for i € {0,1} 


sx, y] = ee if px <ALAx, y) AAx < toy], 
‘ 0 otherwise. 


So if x ranges over (x’ > x’ < t[y]}, then (so[x, y], s\[x, y]) ranges over 
pairs (j,x, jx) such that j,x < tly], fox < t[/,x, y]. Proving the re- 
quired equivalence is now routine. O 
6.8. LEMMA. In H(X) 
dayvz <t[x, y]A(x, yz) o axv2 < tL Ax, Ax]ACix Ax 2), 
Vx < tdyA(x, y) @ 3zVx < tA(x,(z),). 


ProoF. Straightforward. O 


6.9. Proof of theorem 6.6. We use induction on the complexity of A. 
Case 1. A(X, z) = B(z), X not in B. Then 
+ A(X, z) @ axVy < 0(Bz V (0=1A X0)). 


Case 2. A(X, z) = Xt’, x, y & FV(t’). Then 
Ft A(X,z) eo axVy < 0(0=1V (O=0A Xr’). 
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Case 3. Let 

A=BAC, 

Be 3xWyq < tolx, z](Py V (Qo A Xto)), 

Co axWy, < q(x’, z](Pi V (QA Xt) 


(induction hypothesis). For notational simplicity we drop the variable z. 
Contraction of quantifiers yields 


BANC 43xVyy S tol Ax]Vy, < oL Ax] 
[Pol Ax, Yo) V (Qo Ax, Yo) A XtoL Ax, Yol)] A 
[Pi(ax. rn) V (QilAx, 1) A XG LAx, 1]. 
Now we put 
P(x, y,u)= (Po( jx, y) Au =0) V (Py(Ax, y) Au #0), 
O(x, y, u)= (Qo Ax. vy) Au =0) V (OC Ax, y) Au #0), 
t{x,uj) = (1+u)-to[x] + sg(u)- [x], 
t'[x, y,u] = (1+ u)- tolx, y] + sg(u) - ql, yl]. 


Then AA Bo 3xVu <1Vy <?[x,ul(P V (QA Xt’) which is in turn 
provably equivalent to a formula of the class F. 


Case 4. A = B V C can be treated in a similar manner. 
Case 5. A = 1xB, A = Vx < tB are easy. We leave these to the reader. O 
6.10. We shall now show how the sets P, defined inductively by clauses 


A(X, x) © F can be defined explicitly in HA or EL. 
Let 


A(X, z)= 


AxVy < t[x, z](P(x, y, z) V (Q(x, ¥,2z) At [x,y,z] © X)). 
Clearly 


ze Tyo axvy < t[x, z] P(x, y,z), (1) 
and 
zé€yt}oaxvy <7[x,z](PV(QAt’ €I))). 


With the z © I} we can associate a tree of depth 1 with pairs (x’, 2’) as 
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labels at the nodes: 
(0, t’[x, 0, z]) (0, t’[x,1, z])... (0, t’[x, tfx, z]. z)) 


(x,2) 


At the bottom node we find z and the x realizing the existential 
quantifier in (1); there are t{x, z] + 1 successors. This is regarded as a 
correct tree for z © I! iff Vy < t[x, z]P(x, y, z): The labels at the top 
nodes are irrelevant in this case, only the number of top nodes counts; the 
labels assigned in the picture were chosen so as to ensure a uniform 
definition of “correct tree for z € '7*!” for n > 0. 

Suppose we already know what a (correct) tree for an element of If is, 
then a (correct) tree for z € '7*! has the form 


To T, i 
(xost 102) Gal. 2) Gat i ees: 


(x,2) 


(T, a subtree with label (x,, ¢’[x, y, z]) at its root) such that for all 
i < t[x, z] we have either P(x, y,z) holds and T, consists of the single 
node (y), or Q(x, y,z) holds and T, with label (x,, ¢’[x, y, z]) at the 
bottom node is a I'f-tree for t’[x, y, z]. There are ¢t[x, z] + 1 successors at 
the root. 

The trees corresponding to U{I'{: » & N} are all finite trees T of depth 
greater than 0 with finite label functions y, defined on their nodes such 
that 
(i) if n*(y) is a terminal node of T and p;n = (x, z), then P(x, y, z) 

and Or(n i {y)) = (0, t'[x, J, z)); 
(ii) if n*¢(y) © T and m,n = (x, z) then n*(i) © T iff i s ¢[x, 2]; 
(iii) if n*¢y) is not a terminal node of T and y;n = (x, z), then 

p(n *(y)) = (x,, t'[x, y, z]) for some x,, and Q(x, y, 2). 
(T, pr) is a correct (labeled) tree for z if p;({ )) = (x, z). It is now routine 
to formulate an arithmetical predicate F such that F(x, z) expresses “x 
codes a correct tree for z”. 

Now we can formulate our theorem. 


6.11. THEOREM. For each A(X, z) € F there is an arithmetically definable 
P, such that 

(+ W2(A(Py, 2) > PA(2)), 

(ii) + Wz(A(Q, z) > Q(z)) > V2(P,4(z) > Q(z), 

for all predicates Q. 
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Proor. For P, we take dx F(x, z) as indicated above. 
(i) Assume A(P,, z), i.e. , 


axWy < t[x, z](P(x, y,z) V (Q(x, yz) A AuF(u, t’[x, y, z])). 


Suppose we can take x9 for x. For each y < ¢t[x 9, z] we either know 
P(x 9, y, Z) or we know Q(Xxo, y, z) and F(u,, t’[X9, y, z]). We combine 
these data into a new tree T and a new label function 9, = » 


To T, pits Lea 
(> 


such that either (T,, p7,) coded by u, consists of a single node ( ) with 
Prk ) = (0, t'[x9, y, z]) = Cy) ani P(Xo, y, Z) holds, or (T,, pr) is the 
tree given by u, and Q(x, y, z); the label p((y)* 7) is given By the label 
of n in (7, er) i.e. p((y)*n) = pr(n). Finally, p¢ ) = (x, z). 

(ii) Suppose V2z(A(Q, z) > Qz), Le. 


axVy < t[x, y](P(x, y,z) Vv (Q(x, y,z) A Rt'[x, y, z]) > Rez. 
We can show Vxz( F(x, z) > Rz) by induction on x. Suppose 
Vy <xWz(F(u,z) > Rz), F(x, z). 


Then x codes a finite correct tree (TJ, q@); let — tree have nodes 
(0),..., (¢{x, 2]) of length 1, with subtrees 7,..., T;;,, 2}: 

If the subtree F, consists of a single node, « y) is terminal and P(x, ys z) 
holds where pee ) = (x, 2); and if (y) is not terminal, then J, is a 
subtree with code u< x, so Wz(F(u,z) — Rz); therefore dy) = 
(x,, Tx, ys z]) = 7K ), and Q(x, y,z); as a result Rr[x, y,z], by 
F(u, t'[x, y, z]) > Ri'[x, y, z], and thus 


Vy < t[x, 2](P(x, yz) V (Q(x, y, 2) A Ri'Lx, y, 2) 


and hence Az, etcetera. O 


7. Formalization of elementary recursion theory 


7.1, This section describes the formalization of elementary recursion theory 
in HA or EL. For reasons of technical convenience, we start with a set of 
initial functions and closure conditions somewhat different from the one 
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used in 4.1; the equivalence will be intuitively obvious, even though a 
formal equivalence proof is long and tedious. 

We shall treat partial recursive functionals in a single function argument; 
the treatment of partial recursive functions is obtained by dropping the 
function argument everywhere. Throughout this section we tacitly rely on 
2.7.6 (cf. 7.4 below). 


7.2. In the definition below and all following definitions, m is the code of a 
sequence (7m,..., M,_,), and i <k. 


DEFINITION. We inductively generate a set (2(a) of triples intended to 
represent the graph of partial recursive application (i.e. (n,x,m)E Qo 
{x}(mo,..., M,_,) = n)) by the following clauses: 

20 (n, (0, k,0, 1), m) € Q (constant function hx.n) 

Q1 (m,, (0, k,1, i), m) € Q ( projection) 

22 (m,; + 1, (0, k,2, i), m) © Q (successor) 

23 (a(m,), (0, k, 3, i), m) € Q (application) 

24 if no =n, then (n,,(0,k + 4,4),n*m)€Q and if ny #n, then 
(n,, (0, Kk + 4,4), n*m) © Q where n = (no, Ny, Ny, Nz) (definition by 
cases) 

QS if (n,,x;,m)EQ for i<j and (n, x;,(Mo,-.-,Mj-1)) © 2 then 
(ny, Ay is Crigys.ces X;))s m) € Q (substitution) 

Q6 if (n, x, m) € Q then (n, (2, k + 1), (x)*m) © Q (reflection) 

Q7 (Sb(x, y), (0, k + 2,5), (x, y)#m) © @ where, if x’ = (x), - 1, 
Sb(x; y) = <4, x", ¥({0,.x', 0, y),(0, ¥',.1,0), 30. .€0, 21, x" = TD) 
(smn-theorem). O 


REMARKS. (i) 26 simply gives an index to {x }(mo,..., m,_,) as a partial 
recursive operation in x, M9,..., M,_}- 
(ii) Sb(x, y) has been defined in such a way that 
{x}(y, mo,.--, My_1) = {Sb(x, y)} (m9, -.-. my-1) 
(cf. 7.5(4) below, and the clauses 20, 921, §25). 


7.3. LEMMA. (n, x,m) € QA (n’,x,m)EQ>n=n'. 


Proor. By induction on 2. The form in which we have given the definition 
of 9 corresponds to the examples as presented in 6.4. To bring it in 
standard form is routine. Q is generated by a closure condition A,( X, x) 
—> x © X with A,(X, x) of the form 


A,(X, x) V A,(X, x) V +++ VA7(X, x) 
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where A,(X, x) corresponds to clause 92i; for example 
A,(X, x) = 3k3i < k3m(x = (a(m),, (0, k,3, i), m)), 
A,(X, x) = Andyim((n, y,m) Ee XA 
x=(n,{2,Ith(m) + 1),{y)*m)), 


etcetera. O 


The lemma justifies considering 2 as the graph of a mapping, i.e. for m 
of length k we put 


7.4. DEFINITION. 
{x}(a,m) = In.((n,x,m) €@). 
So {x} represents a partial mapping from N“ x N¥ to N; x is called the 


index of that mapping. O 


7.5. PROPOSITION. 
(i) (smn-theorem) For each m,n with 0 < m <n there exists a primitive 
recursive S$" such that 


{Si"(25 Yorse+s Yat) (Os Yess In) = {X} (Gs Yor +s In)» 


(ii) (recursion theorem). There exists a primitive recursive function rc such 
that 


{re(x)}(a, wi) = {x}(a,r0(x), fi). 


(iii) The partial recursive functionals are closed under primitive recursion and 
minimalization. 


Proor. (i) For m = 1, S)(x, y) = Sb(x, y) satisfies (i) (cf. remark (ii) under 
7.2); for S"*1(x, Yo.--+> Ym) We can take Sb(S/"(x, Yo.---s Ym—1)> Yn)= 
(1) Standard from (i); cf. the argument in 4.6. 
(iti) From indices x, y, z we can construct an index u such that 


{x}(m) ifn = 0, 


(Nem aya tom)ea 1) itn #0 


(by an appeal to substitution, reflection, and definition by cases). By the 
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recursion theorem (ii), we find {re({u)}(n, m) = {u}(re(w), n, m), so 
{re(u)}(0, m) = {x}(m), 
{re(u)}(m +1, mi) = {y}({re(u)}(n, m), n, mi). 
We leave the closure under minimalization to the reader. O 
Our next aim is to obtain the 


7.6. THEOREM (normal form theorem). There exists a primitive recursive 
predicate T such that 


{x}(a, m) = (min T(x, a, m, u))o, 
and moreover 
T(x,a,m,u) A T(x,a,m,u’) > u=u', 


Thus, this T can be taken for Kleene’s T-predicate, and uu), for the 
result-extracting function U. 


Proor. We shall inductively define a predicate Comp,; Comp,(u) ex- 
presses that u is (a code of) a finite sequence where (u), is the output and 
(u), the index of the algorithm applied; the remaining components are the 
numerical input (arguments) and codes of subcomputations. a is present as 
a parameter. Mostly we drop the subscript a. The clauses for “Comp” 
correspond to the clauses for 92. 


CO <n, (0, k, n), m,0) € Comp; 

Cl (m,, (0, k,1, i), m,0) © Comp; 

C2 (m;41, (0, k,2, i), m,0) € Comp; 

C3 (a(m,), (0, k, 3, i), m,0) © Comp; 

C4 ny = ny > (ny, (0, k + 4,4), n*m,0) © Comp, 
Ny =n, > (Nz, (0, k + 4,4), n*m,0) © Comp, 
where n = (N19, 2, 22, 23)3 

C5 If there are uo,...,u,., © Comp and u, = ((u,)o, ”,;,m,(u,)3) for 
i<t,and u, = (v, n,, (Uo, --+, U,_1)s(Uy)3) then 
(v, (1, k, n,n), m, (uo, ..., U,_1)) © Comp; 

C6 If v € Comp, v = (u,n, m,(v)3) then 
{u, (2, k + 1), (n)* m, v) © Comp; 

C7 (Sb(x, y), (0, k + 2,5), (x, y)*m,0) € Comp. 


We can now put 


T(x,a,m,u) = Comp(u) A (u)2 =m A (u,) = x, 
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and prove by induction on 2 and Comp respectively 
{x}(a, Hi) =n > 3u(T(x, a, m,u) A (u)o = 7), 
T(x,a,m,u) > {x}(a,x) = (u)o. 


Note that Comp is a primitive recursive predicate (use basic facts about 
primitive recursive sets and the fact that coding of finite sequences is 
monotone, i.e. x < 2 *(x)*m for all x, n, m). We leave the details to the 
reader. Q 


7.7. THEOREM (numerical form of the normal form theorem). There exists a 
primitive recursive predicate T* such that 


{x}(a, m) = (min, T*(x, au, m, u))o, 
T*(x,n,m,u) > Ith(n) =u, 


T*(x,n,m,u)AT(x,n',m,uw) An <non =nAu=u'. 


Proor. From Comp we define 

Comp*(n, u) = Ja(Comp(u) A au =n). 
By induction on u one can show 

VaB(au = Bu > (Comp,(u)  Comp,(u)). 


Substitution of the primitive recursive f= Ax.(n), for a gives Comp,(u) 
<= Comp*(n, u) and therefore Comp* is clearly primitive recursive. For 7* 
we can take 


T*(x, n,m, u) = Comp*(n,u) A (u)) =xA(u),=m. O 


7.8. NOTATION. 
T(x, & Mm, u) = T(x, (a), (m), u), 
T*(x, , Mm, u) = T*(x, n',(m), u) A Wi < p(Ith(n,) = x), 


where n’ = p (Ax. (Fig Viscesh AX me it arbor) and # = (no,..., Mp-1)- 


We shall interpret {t}(ap,.. — 1 Mgys- 
{1}, (@,. . +9 @p- > ™Mo>-+ ), "Di 
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7.9. DEFINITION (continuous function application). 
a(B) = a(B(min,(a(Bz) + 0))) = 1, 
(alB)(x) = a(¢x)* B(min,[a((x)* Bz) # 0])) = 1. 


The operators -(-) and -| - introduced by this definition are partial; e.g. 
sE(Ax.0.(a@)). O 


We note the following corollary of our development of elementary 
recursion theory: 


7.10. PROPOSITION. There are numerals ny) and n, such that 
{ig }(a, B, x) = (aiB)(x), 
{7 }(a,B) = a(B). 


PROOF. Immediate from the definition and routine facts of recursion the- 
ory. O 


Let us now work in a definitional extension of EL with AaB.a(B) and 
AaBx.(a|B)(x) as primitive operations. We can then prove the following. 


7.11. LEMMA. 
(i) If [a] is a functor, and a has length p, then for some primitive recursive 


( F,l7,(@))(x) =@[a](x), and 


(ii) if t{@] is a numerical term of EL, then for some primitive recursive f, 


fv, (&)) = e[a]. 
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PRooF. (i) For notational simplicity let a = a. We first rewrite p[a](x) by 
eliminating the notations o|~, p() with the help of the preceding proposi- 
tion. The result is a term t{a, x] = p[a](x) with Kleene brackets. This can 
be written as {n}(a, Mm, x) for a suitable numeral 7, which is given by 


{7i}(a, m, x) = (min, T(H, a(z), (x, Mm), z))o. 
Now we need a functor f, such that 
(f,18 (x), = (min, T(a, &(z), (x, %), z))o, 
which can be achieved taking f, such that 
(z)ot+1 if 
f,({x)* 0) = Ju < v3z (Ith(u) =z A T(n, a(z), (x, m), z)), 
0 otherwise. 
f, is clearly primitive recursive and 
(gla)(u) = w © f,((x)* a(min,| f,((x)* az) > 0]) = +1, 


which, by the definition of f,, means that (f,|a)(x) = {x}(a, u, m) = 


pla](x). 
(ii) This case is similar but simpler, and left to the reader. O 


7.12. DEFINITION. 
al( Bo, ++ B,-1) = al», (Bos -- 5 B,-1) 
a(Bo,.--, By-1) = a(y,(Bo,--++B,-1))- Oo 


We can now build an analogue of ordinary recursion theory with con- 
tinuous function application replacing recursive function application. 


7.13. THEOREM (smn-theorem). 
(i) There is a primitive recursive binary functional A,, such that 


(a A, Bo) (Bi, ---> Br) = o1(Bos-- ++ By)- 


(ii) There is a primitive recursive binary functional N', such that 
(a An Bo) (By, er aS) B,) = a( Bo, meso, Bi); 


Proor. (i) As before, we write (al(By,..., 8,)\(x) as {i} ((a, B), x), and 
thus by the normal form theorem 


(ai(# ))(x) = (min,7(7, (a, B)z, x, 2). 


Sect. 7] Formatlization of elementary recursion theory 159 


Define a A By by 
(a@ A By)(0) = 0, 
(a A By)({x)*n)=y+1 


edz < Ith(n)(7(7, (a, Biteveaceh pony es x, z) A (z)o =»), 
(a A By)(<x)*n) = 0 otherwise, 


where f, = Az. j'*((n),) (for our standard coding j*(0) = 0). 
(ii) is proved similarly. O 


7.14. THEOREM (recursion theorem). 
(i) For each a there is a B such that 


a|(B, ¥) = B\(7). 


(ii) For each a there is a B such that 
a(B, 7) = B(7). 
PROOF, (i) There exists a specific e such that 
e|(5, 7) = a|(8 A, 5, 7); 


for B =e A,, € we find B\(¥) = (eA, ®)I(¥) = el(e, ¥) = alle A, & ¥) = 


a|(B, Y). 
(ii) Similarly. O 


7.15. NOTATION (A°x, Alx, A®a, Ala). 

Let ¢ be a term in Y(EL) extended with -(-),-| - , provably total for all 
values of x, then A°x.¢ is some term ¢’ primitive recursive in the parameters 
of ¢ minus x such that 


{t’}(x) =t. 


In other words, A°x.t is an index for ¢ as a partial recursive function of x; 
the index may be assumed to depend primitive recursively on the other 
parameters. The following definitions are to be understood similarly. 

If ¢ is a term, A°a.t is some functor g primitive recursive in the 
parameters of f minus a such that 


p(a) =t. 
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If @ is a functor, Alx.p is some functor ~ primitive recursive in the 
parameters different from x such that 


blry.x = @ 
and A’a.p is some functor W’ primitive recursive in the parameters of 
distinct from @ such that 


Y'la = @. 
The choice of the terms and functors for A®x.t, Alx.t, A°a. p, Ala.g is 
immaterial, but they can be defined canonically by induction on ¢ and » 


with help of the smn-theorem. 
In the sequel we shall often use the abbreviations 


Ax.t:= A°x.t, Aap = Aap. O 


8. Intuitionistic second-order logic and arithmetic 


In this and the next section we shall describe some formalisms for intuition- 
istic higher-order logic and arithmetic and prove some of their meta- 
mathematical properties. These sections principally serve as a reference for 
later developments, to be consulted when needed. 

The present section is devoted primarily to HAS, intuitionistic second- 
order arithmetic with full (impredicative) comprehension (this system will 
be considered in E4.4.11-12, E4.5.6 and section 5.7). 


8.1. Intuitionistic second-order logic I1QC?. A simple version of IQC? is 
obtained as follows. The language of IQC?, called Y? for short, is a 
many-sorted language obtained by adding to Y(IQC) variables for n-ary 
relations (n >0) X",Y",Z",..., and quantifiers for them. The super- 
scripts " will often be omitted. If we wish to distinguish between 
(first-order)-quantifiers over the domain of individuals, and (second-order)- 
quantifiers over relations, we use V', 3! and V7, 3? respectively. 

Relation terms: the only relation terms are variables. 

Atomic formulas: first order atomic formulas, and whenever ¢),..., 7, are 
first-order (individual) terms and X” is a variable for an n-place relation, 
X"(¢,..., 1”) is an atomic formula. 

N.B.: in this version a constant n-ary relation R in ¥(IQC) is not treated 
as a relation term. 

Axioms and rules. As our logical basis we can now take any standard 
formulation of many-sorted intuitionistic logic without equality, and in 
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addition the following axiom of full (impredicative) comprehension 
CAs! Xp 60 | Asc ng He VN he es 0) 


where A is any formula of the language not containing X” free. 
In this system equality between individuals can be defined by 


ty = ty = WX'(X(t)) & X(ty)). 


It is easy to see that the usual equality rules now become derivable. For 
example we have the following deduction of the replacement rule (where for 
convenience we drop the parentheses in X(7)) 


(1) Vx(A(x) © ¥x) 
WX'(Xt, @ Xt,) A(t) e Yu 


(1) Vx(A(x) @ Yx) Yt, @ Yt, A(t,)> Yt, A(t,) 
A(t,) @ Yt, Yt, > Yt, Yt, 
Yt, > A(t2) Yt, 
3X'Vx( A(x) © Xx) A(t) 3B(1) 
A(t) 


Moreover, if we define 
AMS YS xy on (XN ee) Se Ves te) 
it is easy to verify by induction on the formula complexity that the equality 


rules also hold in this case. 


8.2. Other formulations of 1QC*. The introduction of pairing operators in 
the domain of individuals is an inessential change in the language. Writing 
(t,, t2) for the pair formed from 4, t, we require 


(41,2) = (1,2) Om = A 2 = 2. (1) 
As soon as we have pairing operators we can also form n-tuples satisfying 
(Xp0000%,) = Oppess BY O SNA OO AK, HD 


We may now think of X"(x,,...,X,,) aS a unary predicate applied to the 
n-tuple (x,,...,X,,), that is to say we only need variables for unary 
relations or sets, and possibly for propositions (i.e. n = 0). However, if we 
wish, we can also define quantification VX°,3X° in terms of VX1,3X! as 
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follows: let c be any fixed individual constant in the language. Then 
WX°4 = WYl4[ X°/Y(c)], 
3X% = ay'4[X°/¥(c)]. 
Intuitively speaking, if Y! ranges over all sets, Y'(c) ranges over all 
propositions. As to comprehension, note that 
3Y'[A © Y'c] 
(x, Y not free in A) holds by applying CA to A A x = x, Le. 
3Y'vx[A Ax=xe Yly]. 


Note also that instead of requiring (1), we can also demand explicitly, in a 
definitional extension, the existence of po, p, decoding the pairing 


Pi(Xo, 1) = x, (i € {0,1}); 
and hence more generally we have p? such that 
BI(Xpree-o%ea) =a) (i <2). 


Finally, thinking now of IQC? as a two-sorted theory with individual and 
set variables, we can write t© X for X(t); © may be treated as a 
two-place relation constant for a relation between individuals and sets. 

Of greater interest is the introduction of a definitional extension by 
means of comprehension terms; for every formula A in the language we 
introduce a second-order term (relation term) Ax,...x,-A (also written as 
{(x,,...,X,,): A}, with an axiom schema 


CRM eee A aa ke) ist, Ae (2) 
In a natural deduction formulation, V7E and 37I now take the form 
WX" A(X") A(T") 
2 2 
WET") VTS aK’? 


where J” is any term for an n-place relation. CA is now absorbed by these 
rules: 


Vx... X,[(AX,--. %,-A)(X1,-..,%,) @ A] (axiom (2)). 
AX" Vx,...x,[X(X%,---.X,) @ A] 


In a natural deduction system it is of course more natural to replace (2) by 
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two rules Al and XE: 


ii Al bigenist 2) 
ORG tA eect) 


ie Ces eC eee 20 fC eee 
A(t,,...5¢,) ° 


A slight variant avoids the explicit introduction of abstraction terms 
altogether; V7E and 37I now take the form 


WX" A(X") AX 
A* 


270 
VT Sy AK)’ 


VE’ 


where A* is obtained from A(X") by replacing all occurrences of 
X"(t),...,¢,) by B(t,..., t,,) for a fixed formula B(x,,..., x,)- 


CONVENTION. Below we shall assume that our standard formulation of 
IQC? has = between individuals as a primitive and that abstraction terms 
are available, unless stated otherwise. O 


8.3. Definability of V, A,3, 1 . In 1QC? with comprehension terms we 
can define L,A,V,34,3? in terms of — ,V as follows (X not free in 
A, B): 


Be = WX X°); 

AA B=VX((A>(B>X)) > X); 

AV B=VWX°((A > X)A(B>X)>X); 
axA = WX%(Wx(4A > X) > X); 

3YA =VX°(VY(A > X) > X). 


For example, we can obtain V E as a derived rule as follows: 


(1) [A] (2) [BJ 
iy) gz 
C 
= OQ (2) 
(A V B) #VWX((A > X) A(B > X) > X) Ae BoC 


(A> C)A(B2C)>C (A > C)A(B>C) 
C 
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In the absence of comprehension terms we need 3? as a primitive (CA is 
formulated with 37). We leave the verification of the correctness of the 
other definitions to the reader. 


8.4. Intuitionistic second-order arithmetic HAS, also called second-order 
Heyting arithmetic. This system is like I1QC?, but the language is restricted 
to logical operations, equality between individuals, a single individual 
constant 0 (zero), a single unary function constant S (successor). There are 
axioms 


Vxy(Sx = Sy>x=y), 
Vx(0 # Sx), 


and induction expressed as an axiom 


VX1(0 € XA Wx(x EX > Sx EX) > Vy(y E X)). 


Thus we think of the individual variables as ranging over N. 

Alternatively, HAS can be described as an extension of HA, obtained by 
adding variables for sets of natural numbers (X, Y, Z), with the corre- 
sponding quantifier-rules and axioms and full comprehension CA for n = 1 
(either explicitly or by permitting comprehension terms); the equality rules 
are to hold for the full language. 

It is the second version that we shall treat as the standard one; from the 
results mentioned in 8.6 below we see that it is in fact a definitional 
extension of the first version. 

Instead of obtaining HAS from IQC? by “identifying the domain of 
individuals with N”’, as in the formulation above, we may also embed HAS 
in IQC? as follows. Define 


yEN=VX'0E XA Vx(x EX > Sx EX) > yEX), 


and interpret number variables as variables restricted to N, and set vari- 
ables as variables restricted to subsets of N, then it is easy to see that all 
axioms and rules of HAS are derivable in IQC? from 


0#S0, Va,meN(Sn = Sm>n=m). 


8.5. DEFINITION. A number structure (X, f,c) is a structure such that 


VY(cEYAWX(xEYOfeY)>XCY). 
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A Peano structure is a number structure satisfying in addition 

c#fe, Wx,yEX(fx=fp>x=y). O 
The following theorem is well-known from classical second-order logic: 
8.6. THEOREM. Jn HAS or 1QC? 


(i) If (X, f,c) is a Peano-structure and (X’, f’, c’) a number structure, 
there exists a unique homomorphic mapping p such that 


gc=c', (fx) =f'e(x) forallx © X. 


(ii) Any two Peano-structures are isomorphic. 

(iii) In any Peano-structure (X,c, f) there are uniquely determined functions 
satisfying the equations for the primitive recursive functions (identifying c 
with 0, f with S). In particular, there are uniquely determined + and 
such that for all x, y © X 


xteo=x, xt+(fy) =f(xt+y), 
x-c=ec, x (fy)=x-ytex. 


REMARK. The function » is in fact definable in the language as a binary 
relation Y,, of functional character, i.e. Vx © Xa!y © X(Y,(x, y)). 


PRooF. The classical proof works intuitionistically as well. See E3.8.3. O 


8.7. The negative translation for HAS. The extension of the negative trans- 
lation is completely straightforward. For simplicity, let us consider the 
second version of HAS without comprehension terms. We only have to add 
a clause in the atomic case: 


KU scctindy) = Sk essed): 

The only axiom which needs checking is comprehension 
AXVx,...x,[A @ X(x,...,x,)], 

which translates into 
—WX-ANX, ...x, [48 @ 4. X(%1,--.,x,) 15 

and this is equivalent to 


nd XVX,-. . x, | A® aad =A X(x,...; x,)} ‘ 
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Observing that —,A® <@ A® for all A, we see that this follows from 
comprehension applied to A® 


AXVx,...x,[A®& o X(x,,...,%,)]- 
Cf. also the E3.8.4. As a result we have 
PROPOSITION. HASS + A => HAS+ 48%. O 


8.8. A Kripke model for HAS. Let us now treat HAS as a two-sorted 
theory. Accordingly, we need to specify two domain functions D and 
D’ in a Kripke model for HAS. We consider a class of Kripke models 
= (K, <, Ik ,D,D’) for HAS where (K, <) is any tree structure (for 
example, the binary tree), D is constant, namely D(k) = N for all k € K, 
and D’ is also constant, namely D’(k) consists, for each k, of all families 
S= {S,.:k' & K} which satisfy 


kos ky - Si Si, 
S, CN for all k’ € K. 


Equality for individuals is interpreted by proper equality at each node, 
+,+, = are defined in the usual way on each D(x): 


(kik t, =t,) et =t. 
Also 
kitnE€ S:=ne S,, where f= (S,2k' € K}. 


8.9. PROPOSITION. A Kripke model of the class defined above is a model for 
HAS. 


Proor. The only schema for which the verification is not entirely trivial is 
CA, Suppose A(x) to have only x free and put 


S,:= {n: ki} A(n)}, 


then, for an arbitrary k’, f= {S,: k € K} is a family belonging to D’(k’) 
and for this Y we have 


kK’ tt Vx(xEf%eA(x)). O 
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8.10. It should be observed that for purely arithmetical sentences A we 
have classically 


k\t+A = Ais true 


and in particular, regarding the models from a classical point of view, 
k \+ A V —A. On the other hand, we do not have 


k Ik WXVx[ Xx V — Xx]. 


For example, let (K, <) be {0,1} with ordering 0 <1, and consider 
S = {S,S,}, with S,=8, S;=N. Then ki Vx[x EL Vix EF] 
meansk lk nE€ Lf Vin € F foreachn;0 K0 EY, but also0 K OE Y 
since 1+ 0 EF. 

We shall see later that this observation can be considerably generalized 
and strengthened. 


8.11. The system HAS). So far we have given no thought to the legitimacy 
of impredicative comprehension from the constructive point of view; in 
HAS we have been restrictive on the logical side, but permissive on the 
definitional side, that is to say we have regarded impredicatively defined 
sets as well-defined. It does not obviously follow from, say, Brouwer’s point 
of view that impredicatively defined sets are to be excluded, though in the 
traditional intuitionistic literature the full force of impredicative compre- 
hension is nowhere used; nor is this the case in the writings of the Bishop- 
or Markov-school of constructivism. 

The principal advantage of HAS over HA consists in the greater expres- 
sive power of the language, not so much in its proof-theoretic strength. We 
shall now describe a subsystem HAS, of HAS which is in fact a conserva- 
tive extension of HA. 

HAS, is defined completely similarly to HAS, except that now CA is 
restricted to formulas A not containing bound set variables (so-called 
“arithmetical comprehension”). Equivalently, the comprehension terms 
\x,...X,-A are restricted to A not containing bound set variables. Under 
these circumstances the induction axiom is equivalent to the arithmetical 
induction schema, restricted to formulas not containing bound set variables. 
It should be noted that in HAS, the definability of 1, V, A,4 in terms of 
VY, — no longer holds, nor can we define equality between individuals. 
Using classical metamathematics we give a simple proof of the following 


8.12. THEOREM. HAS, is conservative over HA. 
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PRoor. We rely on the (classically established) completeness theorem for 
Kripke models and we shall treat HAS, simply as a two-sorted first-order 
theory. Let #= (K, <, i+ ,D) be an arbitrary Kripke model for HA; we 
shall show how to extend % to a model #’ = (K, <, IH’,D,D’) for 
HAS,, IF’ an extension of iF . Without loss of generality we may interpret 
equality in 4% at each node k € K as real equality in D(k) (since = is 
decidable, cf. E2.6.4), and n-ary function symbols may be interpreted at k 
by real functions of D(k)” > D(k). 

For D'(k) we take as elements the collection of subsets Cyig a, 
D(k), with d,,..., d, © D(k), x the only variable free in A(d,,...,d 
A(X,,...,X,, x) € (HA), and such that in the model 


Chidyrorssdgcx) = (x? Aldiecyds, *)): 
We extend IF to IH’ by stipulating 


Kite Cua a yy = KI A(A,..., d,, ¢), 


for all closed terms with parameters from D(k); for other prime formulas 
IF’ coincides with IF . 

It is now easy to check that arithmetical induction and comprehension 
hold in 2’; we leave this to the reader. 

This establishes our theorem, for suppose A to be such that HAS, + A, 
but HA # A, then there should exist (by the completeness theorem) a 
model such that # A. On the other hand %’ models HAS», so 
X' i+’ A; but this contradicts the fact that IK’ extends I. O 


9. Higher-order logic and arithmetic 


Even more so than for the preceding section, it is true that the contents of 
this section are not needed until much later; so the reader can safely skip 
this section for the time being. It is primarily for systematic reasons that we 
have placed this section here. We mainly concern ourselves with types 
obtained by finite iteration of the power-set operation. The section will be 
built up as follows. First, we describe one of the simplest versions of 
intuitionistic finite-type theory and finite-type arithmetic, together with 
some basic and simple metamathematical results (9.1-4). After that we 
shall discuss a number of possible extensions (9.5—12) 
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9.1. The theories TYP and TYP + EXT. TYP is a theory based on many- 

sorted intuitionistic logic. The sorts, called types, are given by 

(i) there is a collection of basic sorts ( primitive types) Bp, By, Bs...-, 

(ii) if o,,...,6, are types, so is o, X --- Xo, (the product type of 
0),.--,6,) 

(iii) if o is a type, then so is [o] (the power type of o; alternative notations 
a > Q or P(o)) 

Intuitively, we think of the basic sorts as a collection of given domains; 
(ii) permits the formation of cartesian products of domains, (iii) the 
formation of power-domains for any given domain. 

We shall assume, for the time being, the basic sorts to represent inhabited 
domains. 

The language of TYP contains equality for each basic type, and a relation 
&, for each type o between elements of type o and elements of type [o}. 
For each type we have a countably infinite supply of variables; for the basic 
types there are individual constants, function symbols and relations avail- 
able. 

For the products we have n-tupling ( ) with inverses p? such that 


PI (Xo.--2>Xn-1) =X; fori<n; 
equality for power types and product types is defined by 
tel = fel Wx%(x Et ox Et), 


1f0% Xn == 90% Xr = A ( prt, = pity). 


i<n 


Finally, we have for each o a ( full or impredicative) comprehension axiom 
schema 


CA axllyy(yexeA(y)) (x €FV(A)) 


for any formula A in the language. This completes the description of TYP. 
The additional axiom of extensionality states replacement for defined 
equality: 


EXT, y?=z°Ay?e wll > 77 & lel; 


EXT is “EXT, for allo”. 


9.2. Elimination of the extensionality axioms. TYP + EXT can be em- 
bedded into TYP via a suitable translation, as follows. For each type 0 we 
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define a predicate Ext, (abbreviating Ext ,(t°) as Ext(z’)): 

Ext(t?) = (18 = 1°), 

Ext(t%% ~~ X@-1) s= A Ext( p"), 


i<n 
Ext(t'?l) = Wx°y°(Ext(x°) A Ext(y’) > (xereyer)). 
We now have the following 
THEOREM. Let A,,, be obtained from A by relativizing all quantifiers to Ext, 
and let FV(A) = {x7f°,..., xf"-'}. Then, for A with FV(A) = {x,...,X,} 
TYP + EXT} A = TYPE Ext(x,) A --- AExt(x,) > Aggy 
and similarly with TYP replacing TYP. O 
9,3. Extending the negative translation to TYP‘ + EXT. The extension in 


this case is not as straightforward as for HAS or IQC?’. The root of the 
trouble is in the translation of EXT: EXT® becomes 


which is not obviously true. We can solve the problem, however, by first 


applying the embedding of 9.2, then *. By a routine argument, we prove the 


LEMMA. TYPS + A => TYPE A8, 
from which we readily obtain 


THEOREM. Let FV(A) = {xj,...,X,,}. Then 
TYP® + EXTFA=> 


TYP + Ext(x,)® A --- AExt(x,)* > (Ag,,)®. 


9.4. Heyting’s arithmetic in higher types HAH. In its simplest version this 
theory is based on TYP with only a single basic type 0 (the type of the 
natural numbers); at type 0 we have the language and axioms of HA 
available, and induction holds for all formulas of the language. 

In the next few sections we briefly explore certain (definitional) exten- 
sions of TYP with a more flexible language. 


9.5. Intuitionistic higher-order logic with existence predicate and description 
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operator. The logical basis is IQCE with description operator (cf. section 
2.2). The collection of types is given by the clauses (i)-(iii), moreover we 
shall also permit the empty cartesian product yielding a type denoted by 1. 
We permit constants for each type; atomic formulas are Er’, ty = ¢§, 
re the), 

The axioms and schemas now become 


te © th] > Et, A Et, (strictness of €), 
E(Xq,-++) X,-1) @ Exy A +++ AEX, -1, 


Ez f\ Ep’ (z € 0X --+ Xo,_1), 


i<n 


VXo ae Xn-1 renee: soe = Iz A prz -x)} 


i<n 
( product and projection axioms; z © 0) X +++ X6,_,), 


E(Iy!".x*[x & y @ A(x)]) (comprehension). 


REMARKS. 
(i) The axioms for product and projection can be replaced by 


Ez > ( piz,..., p%_4z) = z, where z € oy X +--+ Xo,_1, 


(0506+) Xpaa) = (Noreen) @ Axi =r 
i<n 
The verification of this equivalence is left as an exercise for the reader. 

(ii) If in the product and projection axioms we take the empty product 1, 
and interpret the empty conjunction as denoting truth T, we find that 
( ) € 1, and also that this is the unique element in 1. For example, taking 
the formulation above, we see ()=()@T, so E(); also Ez > 
() =z. Let 2 := [1] = P(t); we discover that (2 is isomorphic to the 
collection of propositions; see 9.9 below. 

(iii) Each type o corresponds to an existing object o* of type [o] 
(intuitively speaking, o* is the set of ail objects of type o), namely 


o* = IyNx*[xeyoex=x]. 


We can now enlarge our collection of types by introducing the generalized 
types. 
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9.6. DEFINITION ( generalized types). 

(i) {x°: A(x)} = Dy.Vx°[x © y @ A(x), 

(ii) affel x ++ xaleegl em (2: Are qpiz © tl}, 

(iti) P(r!) = (xl: vy(y € x @ y € l?l)}. (Alternative notation {r!*)},) 
Let us use X,Y for sets defined by means of (i), (ii), (iii); these sets 

include the o* mentioned above. We can define in the usual way (assuming 

all variables and terms to have been provided with the appropriate types) 


Wx E X.A(x) += Wx(x EX > A(x)), 
Ix © X.A(x) = 4x(x EX A A(x)), 
Ix € X.A(x) = Ik[x © XA A(x)], 
{x € X:A(x)} = {x:xEXAA(x)}. O 


We can easily establish the following 


9.7. THEOREM. Assume the types to be such that all formulas below are 
well-formed, then we have 
@ {x et: A(x)} © P(r); 
(ii) se {x Et: A(x)} es Eta A(s); 
(iii) {x € t: A(x)} = {x © t: B(x)} © Vx € t(A(x) © B(x)); 
(iV) Ajend Sb Son ce os Spa) E19. X 29 * Koy A 
Mi <nPi(So meres Sn—1) = Sis 
(Vv) 2€ ty X++* Xt @Ajen Piz Et; AZ = (D9Z,.--> Pa-12)} 
(Vi) (Xqs---5 Xn—a) = (Nore +> Yuna) & Ajen®i = Yi 


PRooF. Straightforward, and left as an exercise. O 
9,8. Next we note that the axioms and schemata in 9.5 also hold for general 
types. Take for example the first axiom 

t7 © th] > Et, A Ety. 


Here t, € o*, t, € [o]*. More generally we have for any ¢ such that f, € 1, 
t, € P(t) are well-formed 


t Et ARE P(t) At, €t, 7 Et, A Et), 


and similarly for the other axioms. 
More precisely, we can enlarge our collection of type symbols and extend 
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the axioms and schemata of type theory to them, so as to obtain a 
definitional extension of the theory. We only have to add to the type-form- 
ing operations the following clause: 

(iv) If o is a type, t © P(o), then so is o 1 t. 

Intuitively, o 1 ¢ represents the set of all type-o objects x such that x € 1. 
We shall assume o | ¢ to be part of 0, so that 12 = 13" makes sense. We add 
to the axioms 


EPO s+, =H" [1 =y], 
E(t@") > wer. 
The reader should work out for himself a variant where terms of types 1?" 


and ty are kept syntactically distinct, with the help of an embedding 
operator for type o 1 ¢ into o. The final result is the following 


THEOREM. The extension of type theory with existence predicate and descrip- 
tion operator to the generalized types is definitional. 


Proor. Left as an exercise. OD 


9.9. The isomorphism between propositions and Q = P(1). We return briefly 
to remark (ii) in 9.5. Since the only element of 1 is the empty sequence, -for 
which we shall write 0, we have 


{x € 1: A(x)} = {0: A(0)}, 
and also 
{0: 4(0)} = {0: B(0)} @ (A(0) @ B(O)). 


To each x € P(f) there corresponds a proposition 0 € x, and to each 
proposition A we have a corresponding element {0: A} © P(#). Clearly 


{0:0Ex}=x, x=yo {0:0Ex} = (0:0Ey}. 
9.10. Relations and functional types. n-ary relations are represented as ele- 


ments of the power type of a product type. If X and Y are generalized 
types, we cari define the set of functions X — Y (or Y*) as 


fEY*% = {fEP(XX Y): vue x(Iv.[(u,v) ef] © Y)}; 
the function notation is available since we have the description operator, 


f(u) = Iv € y[(u, 0) € f]. 
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9.11. The use of variable types. One of the disadvantages of type theory in 
the form considered above, in comparison with set theory, is the fact that 
all objects are supposed to be located in a specific type: all possible kinds of 
power-sets may contain topological spaces for example, or the elements of 
groups and fields may be located in any type. 

This disadvantage of the typed language is rather easily removed by 
admitting variables for types in the language, e.g. a, 8... (there are no 
quantifiers over type variables). This permits us to formulate statements for 
arbitrary types in the theory. It is easy to see that such an extension with 
variable types is conservative. 


9.12. Other formalisms of type theory. There are many formalisms of type 
theory possible which are intermediate in character between the system 
TYP + EXT and the system of 9.5. For example, one may consider systems 
without partially defined terms, but with a liberal supply of types; if 
comprehension types are present, one then has to provide for the possibility 
of empty types (cf. Boileau and Joyal 1981). 


10. Notes 


10.1. Primitive recursive functions. The class of primitive recursive func- 
tions is introduced in Gédel (1931) (under the name “rekursive 
Funktionen”). Even after the discovery of the general recursive functions 
the primitive recursive functions have retained their interest as an im- 
portant subclass and have been the subject of many investigations. 


10.2. Primitive recursive arithmetic. Skolem (1923) develops primitive re- 
cursive arithmetic in an informal setting; Skolem’s work is extensively 
discussed in Hilbert and Bernays (1934), section 7, as an example of 
finitistic reasoning. A neat formalism for PRA is presented by Curry (1941), 
who also gives a proof that the addition of propositional logic to the purely 
equational part of the theory is conservative. The latter result is also found 
in Goodstein (1945) (written in 1941).'An extensive and detailed treat- 
ment of primitive recursive arithmetic is to be found in the monograph 
Goodstein (1957). 

The strong version of induction in primitive recursive arithmetic stated in 
2.5, as well as its equivalence with the ordinary induction rule, is due to 
Skolem (1939), with a correction by Peter (1940). 
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Further references to work on recursive arithmetic are given in notes at 
the end of Goodstein’s monograph, and in the introduction to the transla- 
tion of Skolem’s paper in van Heijenoort (1967). 

PRA as a system is mathematically more powerful than one would at first 
expect, see e.g. Mints (1976), Sieg (1985). 


10.3. Intuitionistic first-order arithmetic. The system HA is not singled out 
as a subsystem in Heyting (1930A, 1930B). A system H’ which is virtually 
the same as HA appears in Gédel (1933); our version of HA coincides with 
version C mentioned in Kleene (1945), section 4. 

In Kleene (1952) quite a lot of theorems of elementary arithmetic are 
proved in HA. 

The natural deduction version of the rule of induction is found in Prawitz 
(1971) and Jervell (1971). 


10.4. Algorithms. As noted in 4.3, it is not the precise formalization in 
arithmetic which matters for “algorithmic mathematics” such as CRM, but 
rather a few essential properties such as the existence of a universal binary 
algorithm for all unary algorithms, and a decidable computation predicate 
(such as Kleene’s T). A nice exposition based on these principles has been 
given by Richman (1983). For an example of an axiomatic approach to the 
notion of computation see e.g. Fenstad (1980). 


10.5. Disjunction and explicit definability properties. These are perhaps the 
most widely known and investigated metamathematical properties of for- 
mal systems based on intuitionistic logic. The general form of the explicit 
definability property is 


ED + JaA(a) >+ A(t) fora suitable 1, 


where a and ¢ are of the same sort. 

Gédel (1932) states the DP for IPC without proof; a proof for IQC is 
indicated in Gentzen (1935). The explicit definability property ED was 
presumably known to Gentzen (cf. Heyting 1934, page 18) and is certainly 
implicit in Gentzen (1935), since the same method which yields DP for IPC 
also gives ED for IQC. 

Kleene (1945, end of section 8), and in greater detail in Kleene (1952, 
theorem 62(b)) gives a proof of EDN and DP for HA, using a variant of 
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realizability. In Kleene (1962) the Kleene slash was introduced (|* of 
E3.5.3), and used to obtain DP and EDN under implication for suitable 
premisses. 

The Kleene slash is closely related to the Aczel slash | (Aczel 1968), on 
which our exposition in section 5 is based (cf. E3.5.3); both yield a 
characterization of the premisses for which relativized EDN and DP hold in 
IQC, IPC and HA (Kleene 1962). 

The class 2# appears in Rasiowa (1954, 1955) for IQC and in Harrop 
(1960) also for HA. More details are given in the introduction of Harrop 
(1956) and in the footnotes to Kleene (1962). 

The Kleene slash has been generalized to stronger systems, see 
e.g. Friedman (1973, 1977), and has also model-theoretic counterparts (cf. 
Smorynski 1973, de Jongh and Smorynski 1976, Aczel 1968, van Dalen 
1984 and the discussion in 13.7.7). It is a powerful tool, see e.g. Friedman 
and Scedrov (1983) for a recent application. 

The fact that all the known proofs establish EDN and DP simultaneously 
is no mere coincidence: Friedman (1975) showed that for systems contain- 
ing a certain minimum of arithmetic, DP implies EDN. For more informa- 
tion about generalizations of DP and ED see also 10.5.3-5. 

Many. authors have treated ED and DP as necessary (and sometimes also 
as sufficient) conditions for a system to be “constructive”. This view seems 
to be ill-founded: for example, if a formal system S proves A V B and is 
constructively well motivated, the formal proof in S corresponds to an 
informal constructive proof of AV B and hence there ought to be 
an informal constructive proof of either A or B; but it does not follow that 
this informal proof of A or B should have a formal counterpart in S; that 
only follows if S has suitable closure properties. In Troelstra (1973A) an 
example, due to Kreisel, is presented of an intuitionistically correct exten- 
sion of HA which does not satisfy EDN. 

ED and DP are also not sufficient for determining the constructive 
meaning of V and I relative to the other logical operators, as may be seen 
from the fact that both HA + IP and HA + CT, + MP satisfy EDN and 
DP, but are inconsistent when taken together (Troelstra 1973A). Here IP is 
the schema 


IP (44 > 3xB) > ax(A > B) (x € FV(A)), 


CT, and MP are defined in 4.3.2 and 4.5.1 respectively. Another example 
has been given in J.R. Moschovakis (1981). 
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10.6. Markov’s rule. The particularly elegant proof presented here is due to 
Friedman (1978) and, independently, to Dragalin (1980). The earliest proof 
is in Kreisel (1958), remark 6.1. Before the Friedman-Dragalin argument 
became available proofs of Markov’s rule were based on a variety of more 
complicated methods (see e.g. Troelstra 1973, section 3.8, Smorynski 1973, 
Girard 1973). For a generalization of Friedman’s argument, see Leivant 
(1985). 


10.7. The formalization of elementary recursion theory. The formalization as 
outlined here is based on three elements: 1° Kleene’s method of indexing, 
introduced in Kleene (1958), 2° the use of (elementary) inductive defini- 
tions, and 3° E-logic with terms constructed with Kleene brackets, relying 
on the fact that the introduction of symbols for provably partial functions 
with the appropriate characterizing axioms produces definitional extensions 
(2.7.6). Kleene’s monograph (1969) treats the formalization in great detail, 
but does not make use of 2° or 3°; the treatment of Kleene brackets in 
Kleene (1969) is ad hoc. Section 6 is an expansion of Troelstra 1973, section 
1.4. 


10.8. Higher-order logic. The relativization of theories with extensionality 
to theories without extensionality has been used by many authors: see e.g, 
Luckhardt (1973) and the references given there. 

Kreisel (1968A) showed how to extend the Gédel-Gentzen negative 
translation to second-order logic with set variables, Myhill (1974) extended 
the translation to simple type theory with set variables. 

For the type theory with E-logic in 9.5-8 we followed Scott (1979). The 
device of variable types is used, for example, by Girard (1971). 


Exercises 


3.1.1. Write down the “defining equations” according to the official definition, for the 
functions in 1.3 (demonstrated in 1.3 for the cases of a constant function and addition). 


3.1.2. Prove 1.4, 1.6. 


3.1.3. Prove 1.10, 1.12, and the assertion at the end of 1.11. 
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3.1.4. Prove 1.14. 


3.2.1. Prove in PRA x —(x + y) = y —(y — x) and (x + y) + y =x (Goodstein 1957, ex- 
amples II, 2.4.21, and formula 2.6.11). 


3.2.2. Define the “integer-square-root” function R by RO = 0, R(Sx) = Rx + (1 ~(SRx)? 
+ Sx))(so Rx is the maximal y such that y? < x) and prove Sx ~(SRx)? = 0,(Rx)? + x= 
0 (Goodstein 1957, 1.6, examples II 2.8.3). 


3.2.3. Introduce a (non-surjective) pairing function P with inverses P,, P, given by P(x, y) = 
(x+y)? +x, Piz =2-~(Rz)?, Pz = Rz~ Pyz, where R is taken from E3.2.2. Prove 
P, P(x, vy) = x, P, P(x, y) = y (Goodstein 1957, chapter 7). 


3.2.4. Use the existence of pairing with inverses to reduce definition of p; (1 <i < m) from 
Winx: (1 <i <n) by 


oe - ¥i(%), 
9; (Sz, x) = xi(z, 91 (2, X),.--, (2, X)s x) 


(definition by simultaneous recursion) to ordinary recursion. 
3.2.5. Generalize proposition 2.5 to 
A(O. Fs ACH, G1 (4 Foe (2 F)) 7 A(SHF) = A(tos7), 
and give a corresponding generalization of proposition 2.6. 
3.2.6. Prove 2.9. 
3.2.7. Give a formalization of the proofs of 2.4(ii)-(iv) in a linear natural deduction style. 


3.2.8. Show that 0 # SO is not provable from the other axioms in PRA if 1 is treated asa 
primitive symbol. 


3.3.1. Show that in HA proofs for 

Q) x=yVx#y, and 

(lil) x=yVx<ypVy<x 

can be given which are simpler than the proofs of these facts presented in section 2 in the 
context of PRA. 


3.3.2. Prove in HA 1.10, 1.12. 
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3.3.3. Give a formalized proof of the strong induction rule (2.5) in HA. 


3.4.1. Let y, x, @ be recursive. Show that then also » given by 


hae =4(x), 
p(Sz,¥) =x(z,%, 9(z, 6(z, ¥))) 


is recursive. 


3.4.2. Let (f;), effectively enumerate all partial algorithmic functions, and let J be any finite 
subset of N with f; = f for all j © J. Then for some i € J f, = f, and i can be found by an 
algorithm (see e.g. Richman 1983). 

Hint. Let @ be a total function such that f,,, =f if f,,(m) is defined, f,,, totally undefined 
otherwise; then show that for some m 


Om € J iff f,,(m) defined. 


If 0(m) € J, f,,(m) is defined, fy, =f; take i = @m. 
If 0(m) € J, fo,, =f totally undefined; then consider @’ such that fg, =Ax.1 if f,,(m) is 
defined, otherwise fy.,, everywhere undefined, etcetera. 


3.5.1. Prove the lemma in 5.7. 


3.5.2. Check the remaining cases in the proof of 5.9. 


3.5.3. The “Kleene slash” is formulated similarly to the Aczel slash: (i), (ii) as before, and 
(iii) P*A V Be= (IFA and + A) or (I'[* B and + B), 

(iv) C}*A > B= (IA and FA) > TI*B, 

(v) PX VxB(x) = C\* A(X) for all n, 

(vi) C'|* 4xB(x) = P'|* BCH) and I+ B(n) for some n. 

Show 


(T[*A and + A) @ IA. 
3.5.4. Complete the proof of 5.12. 


3,5.5. Give the proof of 5.16 in full. 


3.6.1. Bring the closure conditions for examples 1 and 2 in 6.4 in the standard form indicated 
by (7) via (5), (6). 


3.6.2. For the predicate Q of 6.4 write down a formula A(x, z) expressing that z © Q by a 
tree of depth x. Show that 4xA(x, z) satisfies the closure conditions and is minimal. Do the 
same for example 1. 
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3.6.3. Show, arguing classically, that the definition of P, as the first [f for which [= e+) 
satisfies (9) in 6.4. 


3.6.4, Complete the proof of 6.7, and the proof of 6.6 (in 6.9) (Troelstra 1973, 1.4.3, 1.4.4). 


3.6.5. Write down explicitly an arithmetical predicate F(x, z) expressing “x is a correct tree 
for z” (Troelstra 1973, 1.4.5). 


3.6.6. Fill in the details of the proof of 6.11 (Troelstra 1973, 1.4.5). 


3.7.1. Bring the other inductive clauses in the definition of 2 (lemma 7.3) also in standard 
form. 


3.7.2. Prove closure under minimalization for the partial recursive functionals (7.5(iii)). Hint. 
Let y be recursive such that {Y(z9)}(x, J) = {29 }(Sx, J), and apply the recursion theorem 
to the partial recursive {z} such that {z}(xo, x,,~)=0 if {x,}(0, y) = 0, and 
= {Xo }(H(q)) + 1 if (24 }@, y) > 0. 


3.7.3. Let the primitive recursive functionals be generated by 20-5 and 


28 if (n, m, m) © Q then (, (3, k + 1, x, y), (0) * m) € Q; 
if (n, (3,k + 1,x, y),(z)*m) € Q and (n’, y,(n,z)*m) EQ 
then (n’, (3, k + 1, x, y),(Sz)*m) € Q. 


Show that there is an index for a primitive recursive function representing Sb(x, y); show also 
that the primitive recursive functionals are total. 


3.7.4. Prove the second fact in 7.6: 


T(x,a,m,u) A T(x,a,m,u') sua’, 


3.7.5. Check the second and third property of T* in 7.7. 


3.7.6. Prove 7.11(ii), 7.13(ii), 7.14(ii). 


3.8.1. Check in detail that the addition of propositional variables with full comprehension for 
propositions is conservative over the system of IQC? with propositional variables deleted (8.2). 
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3.8.2. Complete the proof of the definability of 1, A, V,3 in 8.3. (Prawitz 1965, chapter V, 
theorem 1). 


3.8.3. Prove theorem 8.6. 
Hint. (i) Let us suppose, for simplicity, to be working in HAS and choose (N,0,S) for 
(X, c, f). Now define 


¥, (u,v) = VZ?((0,¢’) E Z7 A 


vaxy((x, y) € Z? > (Sx, fy) € Z*) > (uv) e€ Z?). 


With induction we can show that ¥, is a relation with functional character. 

(ii) (In IQC?) relying on (i) we let Ay.x + y be the isomorphism from the Peano-structure 
(X, c, f) onto (X, x, f) (for any x € X), and we let Ay.x-y) be the isémorphism from 
(X, c, f) onto (X, c, Ay.x + y) etcetera (Henkin 1960). 


3.8.4, Show that the negative translation can be applied directly to HAS formulated with 
comprehension terms. 


3.8.5. Show that HAS® and HAS prove the same I9-formulas, e.g. by applying 2.3.16-18 to 
HAS as embedded in IQC?. 


3.8.6. Show in detail that 2%” as constructed from ¥ in the proof of 8.9 indeed validates 
comprehension (de Jongh and Smorynski 1976). 


3.8.7. Consider the following conservative extension H of HAS: we add constants Cg, for 
each formula B of H not containing free set variables, and each set V C N, and we add axioms 


(%150069 Xn) © Ce & BX, Xn)s 


where FV(B) = {x,,..., x, }. We now define the following extension of the slash (cf. 5.7) for 
sentences of H, as follows 
G) |\(t=s)=Fre=s; 
[(tseess tn) © Ce y = (hb... lt) € V (here |z| is the value of ¢ in N); 

(ii) the clauses for A A B, A V B, A > B,WxA and 3xA are as before; 
(iii) |VXA(X) = for all Cp [A(Cp,y); 
(iv) |3XA(X) = for some Cg, |A(Cz, 7) 
(deducibility refers to deducibility in H). 

Prove the soundness theorem in the following form 


HAS + A =|A 


for all sentences A. 
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Derive from this the numerical existence property and the (closed) set-existence property 
(explicit definability for sets) in the form 


HAS + 3XA = HAS A[X/Ax.B], 


where FV(B) = {x}, A not containing free set variables (Friedman 1973; Troelstra 1973, 
3.1.19-23), 


3.9.1. Prove the theorem in 9.2 (Myhill 1974, Luckhardt 1973, chapter 2). 
3.9.2. Prove the lemma and theorem in 9,3 (Myhill 1974). 
3.9.3. Prove the equivalence in remark (i) of 9.5. 


3.9.4. Prove theorem 9.7 (Scott 1979). 


3.9.5. Give the details of the theorem in 9.8; devise a formulation where types o and o | ¢ are 
treated as disjoint, and ¢? and 1°" are regarded as syntactically distinct (Scott 1979). 


3.9.6. Show that the following yields a formal system equivalent to TYP + EXT: 2 itself is 
regarded as a type; formulas are terms of type Q, and if t,, t, are terms of type o, f, = ty isa 
term of type 2. If ¢ is of type 2, we regard ( ) € t as synonymous with f itself (( ) is the 
empty sequence); T is a term of type 92. We have the axioms 


A=>A; T= x=x; ANx=1= A[x/t]; 
A=>xXE{(X):A}, KE {(X): A} >A, 
where = A is short for T = A. We also have the rules 


A>B,B=>C A>B,A>C A>BAC A>BAC 
A>C A>BAC A=>B A=C 


A=>B AA(X) EL P(X)EQ AAT) ELH >(X) EK 
A[x/t] => B[x/t] A>h=t 
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and we define 
WxA = ({x: A} = {x: TH), 
A> B=(AAB=A), 
AV B:=Yw"((A 9 w) A(B> 0) > w=T), 
L:=Vwo(w=T), 
3xA = Vo" ((VxdA > w) ew =T) 


(Boileau and Joyal 1981). 


CHAPTER 4 


NON-CLASSICAL AXIOMS 


The present chapter is devoted to a discussion of several principles 
(axioms) which play an important role in various forms of constructive 
mathematics. The most interesting ones among such principles are those 
which clearly express that we have an interpretation of logic in mind which 
differs from the classical reading; in other words, principles which are 
formally incompatible with classical logic. The main examples of such 
principles are various (intuitionistic) forms of Church’s thesis (sections 3, 
4), (partially) expressing the idea that “all operations in mathematics are 
algorithmic” (corresponding to “Constructive Recursive Mathematics” 
CRM) and continuity principles for choice sequences, originating in the 
adoption of non-predeterminate sequences as legitimate objects of intuition- 
istic mathematics (section 6 and part of section 7). 

Besides these “non-classical”? principles there are also some principles, 
which are valid on a classical reading of the logical operators, but which in 
a constructive setting are in need of a different justification, and which, 
moreover, are mathematically interesting in combination with the non- 
classical principles mentioned above. The principal examples are axioms of 
countable choice (section 2), Markov’s principle (section 5), the compact- 
ness of the Cantor discontinuum (section 7), bar induction (section 8), 
Kripke’s schema (section 9). Section 9 also discusses the uniformity princi- 
ple, which is classically false. 

The reader can proceed in at least two different ways. The sections may 
be read consecutively, giving an overview of important principles of con- 
structive mathematics. It is also possible to postpone reading the various 
sections until needed for mathematical applications. Sections 8 and 9 of the 
present chapter hardly play a role in chapters 5-8 on constructive mathe- 
matics. 
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1. Preliminaries 

1.1. NOTATIONS AND CONVENTIONS. Unless stated otherwise, variables 
ranging over the natural numbers are represented by ordinary lower case 
letters. Greek lower case letters a, 8, y,6 are metavariables for variables 


ranging over N — N, roman capital letters X,Y, Z for set- or relation 
variables. We define 


a = B= Vx(ax = Bx), a < B= Vx(ax < Bx). 
For initial segments we use 
aO=()=0, a(x+1) = ax*(ax), 
a En:= (a(Ith(n)) = 7) = 4x(ax = n) (a@ has initial segment 7). 


Concatenation can be extended to concatenation of a finite sequence with 
an infinite one: 


(n), if x < Ith(n), 


(n*a)(x) = ie ~|th(n)) if x > Ith(n). 


We may think of n* a as “n followed by a”. For predicates of sequences 
we shall assume extensionality: 


A(a) Aa= B= A(B). 


We shall use notations such as t € X, X(t) (or even Xt, if no confusion is 
to be feared) interchangeably. O 


1.2. DEFINITION. We define a tree T as an inhabited, decidable set of finite 
sequences of natural numbers closed under predecessor; so T is a tree iff 


(YET, Va(neTvVn€T), 
Vam(n€ TAm<n>meéT). 

A spread is a tree in which each node has at least one successor: 
Wn © TAx(n*({x) ET). 


A finitary spread or fan is a finitely branching spread, ie. a spread 
satisfying 


Wn © TAzWx(n*(x) € T> x <2). 


Sect. 1] Preliminaries 187 
A sequence @ is a branch of the tree T iff all initial segments belong to T: 
a € T:=Vx(ax ET). 


For quantifiers over finite sequences ranging over a tree T we shall often 
write Vn © T or Vn, dn © T or Jn,; similarly we use Va7,Ja;>. O 


1.3. DEFINITION. The universal tree or universal spread T,, consists of all 
finite sequences of natural numbers; so modulo coding 7,, = N. The infinite 
branches of T,, constitute NN, 

The binary tree consisting of all 01-sequences will be denoted by 7,,. T’ 
is a subtree of T iff T’ Cc T and T’ isatree. O 


1.4. LemMa. Let T’C T, T’ and T spreads. Then there is a mapping 
I< T-T' such that 

(i) Ith(P'n) = ith(n) for alin € T, 

Gi) Pn =n forn eT’. 

N.B. I can be extended to infinite branches by Pa := x.(Ta(x + 1)),; then 
T(la) =a fora eT, Ta =a fora eT’. 


Proor. Exercise. O 


1.5. DEFINITION. A topology on a set X can be defined, as usual, as a 
family 7 of subsets of X, containing 8, X and which is closed under finite 
intersections and arbitrary unions. A basis @ for a topology 7 on X is a 
family @ Cc J such that 


VWYET(Y=U{X:XEBAXCY}). a 


Observe that for a basis #: 
ifUEB,UEB,xEUNU' then3U” € B(x EU" CUNU’); 
ifx € X thn JU E€ B(x EU). 


A spread T can be given a topology by taking the infinite branches as 
points, and as basis the sets 


V,= {a:aEGnAaeET} foreachne T. 


Thus 7), as a topological space is 2“, the Cantor space. T,, is NN, the Baire 
space, with the product topology obtained from the discrete topology on N, 
as in the classical theory. (For more information about constructive topol- 
ogy see chapter 7.) 
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A topological space is a pair (X, 7 ) consisting of a set X with a topology 
ZF on X; the elements of 7 are the open sets of the space and the elements 
of X are the points of the space. We define the interior of a set Y C X as 


Int(Y) =U{vVeT:VcY}. 


A continuous mapping from a space I’ to a space I” is a mapping for 
which the inverse image of an open set is always open, or in terms of bases: 
the inverse image of a basis element is a union of basis elements. A 
homeomorphism from a topological space [' to a space I” is a continuous 
bijection with continuous inverse. 

Thus, on spelling it out, continuity for 6 € N“ > N and ¥ e NX > NN 
is equivalent to 


VadIxVB € ax(®a = 6B), and 
VxVayvB € ay((¥a)(x) = (¥8)(x)), 
respectively. The second line states that Aa.( Ya)(x) is continuous for all x. 


Oo 


1.6. Axiomatic basis. Most of the discussion is informal, i.e. no particular 
formalism is presupposed. The principles of arithmetic, in particular induc- 
tion, will be tacitly used; for functions we shall assume function compre- 
hension 


AC! = Wxd!yA(x, y) > JaWxA(x, ax), 
ice. if A(x, y) is a binary relation with functional character 
vxyy'(A(x, y) A A(x, y') oy =y') A WxdVA(x, y), 


then there is a function with its graph given by A. 
As to the existence of sets, as a rule we shall accept comprehension in the 
form 


VXVx[x © Xo A(x)] (X not free in A) 


for A constructed from prime formulas, propositional operations, quan- 
tifiers over natural numbers and quantifiers over number-theoretic func- 
tions, By the existence of pairing functions this implies the corresponding 
principle for n-place relations: 


AX" Vx[K eX" © A(xX)]. 
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We also accept a corresponding principle for sets of number-theoretic 
functions: 


aXVala eX @ A(a)| 


under the same restrictions as above. The use of impredicative comprehen- 
sion, where also set- and relation quantifiers are permitted in the construc- 
tion of A, will be noted explicitly (cf. also section 3.8). 


2. Choice axioms 


2.1. Axioms of countable choice. The simplest axiom (schema) of countable 
choice is AC-NN, also frequently denoted by ACg) 


AC-NN = WndmA(n, m) > JaWnA(n, an). 


Informally, this schema may be justified as follows: a proof of the premiss 
should provide us with a method to find, for each n € N, an m € N such 
that A(n,m); such a method is nothing else but the description of a 
function assigning the required m to n. 

In the same manner one can justify the more general countable choice 
axiom AC-N (= AC,) for arbitrary domains D: 


AC-N Wndd € DA(n,d) > Ig EN>DVnA(n, pn). 

The principle 
AC-NF = WndaA(n, a) > 38WnA(n,(B),,) 
(also denoted by AC,,), where (8), := Am.B(j(n, m)), is a special case of 
AC-N, since a mapping p € N > N% may be coded by a BEN ON 
such that B( j(n, m)) = y(n)(m). 

It might be tempting to assume the more general axiom AC for any two 
domains D, D’: 


Vx © Day € D’A(x, y) > Ig © D> D' Vx € DA(x, px). (1) 


However, we assume that domains of quantification always come equipped 
with a notion of equality on them, and although a proof of the premiss of 
(1) must contain a method for transforming any proof of x € D into a 
proof of Jy € D’A(x, y), which implies that we must have a method for 
extracting, from the proof that x € D,a y € D’ such that A(x, y), there is 
no guarantee that such a method respects the equality =p) on D. For 
example, consider the true statement 


Vx € Ran © N(x <n), (2) 
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There is no continuous function assigning n to x, because such a function 
would have to be constant; on the other hand we have no hope of giving an 
example of a discontinuous function from R to R (cf. the discussion in 
1.3.6). Also, free use of set comprehension and AC permits us to derive 
PEM (exercise 4.2.1). 

Note however, that if we represent reals by fundamental sequences of 
rationals with a fixed rate of convergence, i.e. sequences (7, ),, such that e.g. 
Vam(|r,, — T,4m| < 27"), then we can assign to each (r,), an m EN by 
means of a function @ such that for the real x determined by ¢,,,),, we have 
x < m(take p((r,),,) to be the least natural number k such that 79 + 1 < k). 

The situation of example (2), where AC fails, may be described in more 
general terms as follows. 

=p is a defined notion, i.e. there is an underlying set (D”, =,,,) such 
that D consists of equivalence classes of elements of D” modulo some 
equivalence relation =~ . Then the method implicit in Vx € Day € D’ 
perhaps respects =), , but not necessarily =p . In example (2) above D” 
is the collection of fundamental! sequences. 

Another at first sight surprising aspect is the following. Suppose that 
(D, =p) is obtained as a subset of (D”, =p.) and that =p is the 
restriction of =p, to D. Now one might expect that if AC-D” holds, then 
AC-D also holds where the g in AC is the restriction of a function defined 
on D”. However, the method p given by the truth of the premiss of AC can 
act not only on the element d € D C D” as such, but also use information 
from the proof that d © D” actually belongs to D. As a result, p is not 
necessarily the restriction to D of a partial function from D” to D’. In this 
situation even 


AC-DD") Wx € Daly € D’A(x, y) > Ig € D> D' Vx € DA(x, x) 


may fail. We shall encounter this situation in 4.11. 

So why is AC-N acceptable? Primarily because n € N is immediately 
given to us, it is not in need of a proof. No extra information is needed to 
tell us that n, as an object constructed in a canonical way from units is a 
natural number. 

It should be noted on the other hand that AC-NN is only justified in case 
our notion of function is as wide as possible, ie. embraces every possible 
method for assigning numbers to numbers. 


2.2. The presentation axiom. Reflection on the reasons of the failure of the 
general axiom of choice, described above, suggests a principle called the 
Presentation axiom: each set D is the surjective image of an underlying set 
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D” such that AC-D’D’ holds for arbitrary D’. We shall not make use of 
this principle in our constructive mathematics. For arithmetic it can be 
justified by recursive realizability (end of 4.12); it also crops up in connec- 
tion with constructive set theory CZF (11.8.25). 


2.3. Axioms of dependent choice. Let x and y range over some domain D, 
then the axiom of dependent choices is the principle 


DC-D  Vx3yA(x, y) > 
Vxdp E N> D[90 = x A Wn A(n, o(n + 1))). 


Informally, we may justify DC-D as follows: let x € D; put p0 = x; find y 
such that A(@0, y), put ol = y; find z such that A(ql, z), put p2 = z etc. 
For DC-N (resp. DC-NY) one often writes DC, or DC-N (resp. DC,). 

An even more general principle is the axiom of relativized dependent 
choices over D 


RDC-D Vx € D[ Ax > 3y € D( B(x, y) A Ay)] > 
Vx € D[ Ax > 


Jp € N>D(g0 = x A Wn B(gn, p(n + 1)))]. 
Some logical relationships are given in the exercises. 


2.4. REMARKS (on terminology). In the literature one frequently encounters 
the distinction between “operation” and “function”, as well as the terms 
“extensional” and “intensional” applied to operations and functions. 

Given two domains D,D’, the term function is reserved for an assign- 
ment of values in D’ to the elements of D which respects equality, i.e. 
xX =p) Y > PX =p vy for all x, y © D; an operation need not respect =p 
but perhaps only a narrower relation of equality between the objects in D 
(cf. also the discussion of the presentation axiom above). 

Informally, a notion of equality is said to be extensional, whenever the 
equality is defined as in the corresponding classical set-theoretic setting. A 
function is extensional if it respects extensional equality on its domain (in 
other words, if its domain is supposed to be provided with extensional 
equality). For example, a = B := Vx(ax = Bx) is the usual extensional 
equality between functions, and an extensional functional ® on N —-N 
with values in N is an operation on N > N which respects extensional 
equality on N > N. 

We use, informally, the terms non-extensional or intensional if the equality 
or function/operation is not extensional in the sense just described. A 
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non-extensional function does not respect extensional equality but a nar- 
rower equality relation, e.g. when all functions in N — N are assumed to be 
given by algorithms and a functional ® on N —WN respects equality 
between algorithms only. 

Of course, if algorithms describing functions a, 8 are given by code 
numbers, one might avoid talking about “non-extensional equality” be- 
tween a and 8 by working with the code numbers of a and £; in other 
words, we might make a notational distinction between the algorithms and 
the functions described by them. However, in metamathematical researches 
it is not always convenient to do so (cf. chapter 9). 

In specific cases the precise sense of “non-extensional”, “‘extensional” 
will always be clear from the context. 


3. Church’s thesis 


3.1. Church’s thesis CT. If we think of all our mathematical objects (predi- 
cates, operators etc.) as completely determined, fully described, we are in 
the realm of “lawlike” constructive mathematics. In particular, this means 
that elements of N > N are /awlike, i.e. completely determined by a law 
given in advance. 

The idea of lawlikeness may be strengthened further by identifying 
“lawlike” with “recursive”. The identification can be motivated in two 
different ways: 
foundationally, by regarding the identification of lawlikeness with recursive- 
ness as intuitively justified, on the basis of an analysis of the notion of 
lawlikeness; 
pragmatically, by treating “recursive” as a mathematically precise concept 
which can replace the informal notion of lawlikeness in mathematical 
developments. 

The difference between these is one of emphasis: on the foundational 
approach, the arguments in favour of “lawlike = recursive” carry much 
more weight than on the pragmatical approach. 

The identification of “lawlike” with “recursive” for functions can be 
expressed by an axiom (““Church’s Thesis”) 


CT VaaxVy3z(Txyz A Uz = ay). 


For a striking consequence of CT see E4.7.7: Cantor space and Baire space 
are homeomorphic. 
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3.2. The arithmetical form of Church’s thesis CT,. Taken by itself, CT may 
be regarded as a definition of the notion of “number-theoretic function”, 
by identifying N™ with the collection of all total recursive functions. This 
changes if we combine CT with AC-NN; then we obtain the following 
schema 


CT,  WnamA(n, m) > AkWnim[A(n, Um) A Tknm]. 


CT, is called the arithmetical form of Church’s thesis, since only numerical 
quantifiers appear in it. 

The acceptation of CT, is equivalent to the assumption that the function 
implicit in an intuitionistic proof of a Vndm-statement is always recursive. 
Recall, on the other hand, that AC-NN was regarded as plausible precisely 
because of the fact that any method for assigning numbers to numbers was 
regarded as a function. AC-NN may still be regarded as rather plausible if 
we think of our mathematical universe as consisting of lawlike objects only. 
Note, that if we adopt CT, it means that we also have to accept the 
identification of lawlike with recursive. If we feel convinced by the argu- 
ments in favour of an identification of “‘lawlike” with “recursive”, we have 
no reason to doubt the laws of intuitionistic predicate logic, even if we 
adopt CT). If, on the other hand, we do not regard the identification of 
“Jawlike” and “recursive” as justified, CT) imposes a reading of quantifier- 
combinations ““Wndm” stronger than what is required by the BHK-inter- 
pretation. This leaves us with a problem: are the laws of intuitionistic 
predicate logic still valid, if we read all combinations Vn4m in the stronger 
sense dictated by CT,? (N.B. This is not obvious: if in implications 
VnimA(n, m) — Wn'dm'B(n', m’) the meaning of both premiss and con- 
clusion is changed, the new interpretation of the implication is not neces- 
sarily comparable with the original one.) 

For the time being, however, we shall tacitly assume that adopting CT) is 
compatible with intuitionistic predicate logic; in the next section a meta- 
mathematical justification for this fact is discussed. 

In view of (A V B) @ 34z((z =0 > A) A(z #0 B)) we can for- 
mulate a version of CT, for disjunctions 


CT,Y = Wx( Ax V Bx) > 
da € TREC Vx((ax = 0 > A) A (ax #0 > B)). 


3.3. CT does not conflict with classical logic unless certain function- 
existence principles are present, such as “countable (function-) comprehen- 
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sion” 
AC-NN! — Wn3!mA(n, m) > JaVnA(n, an); 


classically AC-NN! establishes the existence of non-recursive functions, for 
example, an a satisfying 


= 1 if ImTnnm, 
aa 0 if =dmTnnm. 


Since CT, implicitly contains AC-NN, we have an almost immediate 
conflict with classical logic: 
3.4. PROPOSITION. 

HA + CT, + =Wx[adyTxxy V AAS yTxxy]. 


Proor. Assume Wx[4yTxxy V —AdyTxxy] and apply CT,’; we find a 
recursive a such that 


Vx((ax = 0 > “dyTxxy) A (ax # 0 > —4yTxxy)); 
hence Vx(ax = 0 @ =JyTxxy), and this would make JyTxxy recursive. 
Contradiction, by 3.4.4. O 
COROLLARY 1. Assuming CT), =V-PEM holds. 


Proor. Let the a in Va(A4dx(ax = 0) V —-3x(ax = 0)) range over the 

characteristic functions y, of Txxy as predicate in y. O 

COROLLARY 2. Assuming CT , the following schema of predicate logic 
Vx~>4A(x) ~ —AWxA(x) 


is not generally valid. 
Proor. Take A(x) = 4 yTxxy V -=-dyTxxy. O 


REMARK. V-PEM is implied by 3-PEM, hence .V-PEM — —J-PEM, where 
3-PEM = Wa(3x(ax = 0) V Adx(ax = 0)). 


Bishop (1967) called PEM “the principle of omniscience”, and 41-PEM “the 
principle of limited omniscience”. Many well-known theorems of classical 
analysis do not require full PEM, but only 3-PEM. 
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3.5. PROPOSITION. Assuming CT), we can show =VxA(x, a, B) for suitable 
a, B, where 


A(x, a, B) = A(Ay(a(x, y) = 0) A ay(B(x, y) = 0)) 

> -4y(a(x, y) = 0) Vv -ay(B(x, y) = 0). 

Proor. Let X '!= dy(a(x, y) = 0), Y:= dy(B(x, y) = 0) represent two 
disjoint, r.e., recursively inseparable sets (e.g. by taking for a, B the 
characteristic functions of Txxy A Uy = 0, Txxy A Uy = 1 respectively; 


cf. proposition 3.4.5). Then the premiss of A(x, a, 8) holds for all x, so if 
VWxA(x, a, 8), then 


Vx(-3y(a(x, y) = 0) v aay(B(x, y) = 0)); 
with CT,” we find a recursive y such that 
v(x) = 0 > Aay(a(x, y) = 0), 
v(x) # 0 > way(B(x, y) = 0), 
and this contradicts the recursive inseparability of X and Y. O 


3.6. COROLLARY. The logical schema 
(A A B) > (4A V AB) 
has been refuted by the preceding proposition in its universally quantified form 
aVx[-(A(x) A B(x)) > (4 A(x) V AB(x))], 
with A(x) = Ay(a(x, y) = 0), B(x) = 3y(B(x, y) = 0); a fortiori we have 
refuted it in the form 
“SEP = —VWaB[=(4y(ay = 0) A 3y(By = 0)) 
> (Aay(ay = 0) Vv =ay(By = 0))]. 


4. Realizability 


4.1. Kleene (1945) proposed realizability as a systematic method of making 
the constructive content of arithmetical sentences explicit. Kleene’s starting 
point was that an existential statement B = 3xA(x,,...,x,,x) is an “in- 
complete communication”: one can only assert B, if x can be produced 
(depending on the parameters x,,..., x,,). Similarly, one must be able to 
decide between the disjuncts in order to assert a disjunction. 
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If one now also assumes that all existential and disjunction statements 
have to be fulfilled recursively in the parameters, one arrives at Kleene’s 
notion of recursive realizability. 
In the form in which it is given by Kleene, realizability is comparable to a 
classical truth definition: it interprets sentences of arithmetic in the in- 
tended structure “= (N,0, S, +,-,...) of the natural numbers. Let us use 
n,m,... for arbitrary elements of N, and let 7, m,... be the corresponding 
numerals in the language of arithmetic. “‘n realizes A” is defined by in- 
duction on the complexity of A; n is a number hereditarily coding all 
information concerning the explicit realization of existence quantifiers and 
disjunctions occurring in A. The clauses are 
(i) 7” realizes (t = s) iff “t= s” holds in W; 
(so if t = s holds, any number realizes it; the truth of a prime sentence 
gives no interesting information on the realization of 3 or V); 

(ii) mn realizes A A B iff j,n realizes A and j,n realizes B; 

(iii) n realizes A V B iff j,;n =O and j,n realizes A, or jn #0 and j,n 
realizes B; 

(iv) n realizes A > B iff for each m realizing A, {n}(m) is defined and 
realizes B; 

(v) 7 realizes 4xAx iff j,n realizes A[x/ j,n]; 

(vi) n realizes VxA(x) iff for all m, {n}(m) is defined and realizes A(m). 

For arbitrary formulas A we may define “n realizes A” as “n realizes the 
universal closure of A”. 

It is to be noted that if we read in the BHK-interpretation “realizing 
number” for “proof” and “partial recursive operation” for “construction”, 
the realizability interpretation appears as.a variant of the BHK-interpreta- 
tion. 

Instead of working with the semantical version given above, it is more 
advantageous to reformulate realizability as a syntactical mapping assigning 
a formula xrA of HA to each formula A of HA, where x € FV(A) of HA. 


4.2. DEFINITION. xfA is defined inductively by the following clauses, 
where x € FV(A). 


(i) xe[t=s]) =[t=s], 

(Gi) xK(A A B) = AxrA Aj,xrB, 

(ili) xr (A V B) = (jx =O Af) xtA) V (jx #FOA xB), 

(iv) xr(A > B):=Vy(yrA > du(Txyu A UurB) (u € FV(B)), 
(v) xtVyA = Vydu(Txyu A UurA), (u € FV(A)), 

(vi) xr dyA = fox t Al y/f,x]. 


In more informal notation with “Kleene brackets” (iv) and (v) may be 
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rendered as: 
xt(A > B) = Wy(yrd > E{x}(y) A {x}(y)rB), 
xtWyA = Wy(E{x}(y) A {x}(y)r A). 


REMARKS. 

(i) Clause (iii) is equivalent to xr(A V B) = (jx =0 > j,xKrA)A 
(j;x #0 > j,xrB). 

Gi) x rad @ Vy(yrA > {x}(y)r( = 1)), hence xran4 eo 
Vy(AyrA). Thus a negation is realized by some number iff it is realized by 
all numbers. 

(iil) xr4Ad © Vy(yraA) @ Aay(y tA) © AayW2(4z9rA) eo 
—742(zrA). O 


4.3. Characterization of realizability. We shall now sketch an axiomatic 
characterization of realizability. 

In doing so, we shall freely use the Kleene bracket notation, and E-logic 
on an informal level. A detailed formal treatment can be given in a 
definitional extension of HA with Kleene brackets and the existence predi- 
cate E, in which elementary recursion theory can be formalized (see 
sections 2.7 and 3.7). Some remarks on a more precise treatment are made 
in 4.13. 

In our characterization, an important role is played by the almost 
negative formulas. 


4.4. DEFINITION. A formula A of &(HA) is almost negative or essentially 
d-free iff A does not contain V, and J only immediately in front of prime 

formulas. 
In other words, A is almost negative iff it is constructed from prime 
formulas and existentially quantified prime formulas by means of A, > ,V. 
a) 


For an essentially 4-free formula 9 there is no non-trivial information on 
the realization of 3, V: @ is true iff realizable, and if @ is realizable then it 
can be realized by a canonical term depending on q only. This is the 
content of the following 


4.5. PROPOSITION. Let A(X) be essentially 3-free, FV(A) © x. Then there is 
aterm W, with Kleene brackets containing at most x free such that 

(i) HAF 4y(yrA) > A, 

@ii) HAF A> Ey, A,r. 
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Proor. ¥, is constructed by induction on the complexity of A. 


(a) ¥., =9, 

(b) Wayc-sy = J(min,[t = s], 0), 

(C) Vann =I(4 vp), 

(d) Yuva = Ay. Way) 

(eC) Wang =Aydy (y € FV(B)). 


In (d), (e) we used the notation A y.t for a term ¢ with Kleene brackets to 
indicate a canonically chosen index for the partial recursive function A y.t. 
For the proof one establishes (i) and (ii) simultaneously by induction on the 
complexity of A. We leave this as an exercise. O 


Because of the properties (i) and (ii) the almost negative formulas are 
sometimes called “self-realizing”. We note the following 


4.6. LEMMA. For all A: 
(i) x¥A is equivalent to an almost negative formula; 


Gi) (tr A)[x/s] = t[x/s]r A[x/s]. 


Proor. By induction on A. Consider e.g. the clause for implication: 
xr(B>C)eoVy(yrB > Az(Txyz A UzrC)) @ Vy(yrB > AzTxyz A 
Vu(Txyu > UurC)); and the latter formula is equivalent to an almost 
negative formula since by induction hypothesis yrB and UurC are 
equivalent to almost negative formulas, and 4z7xyz can be expressed as 
4z(x7(x, y, z) = 0), where x, is the characteristic function of T. O 


REMARK. From this lemma we can conclude that the properties (i) and (ii) 
of 4.5 characterize almost negative formulas up to logical equivalence. For 
this we have to show that ¢ = x, for a term ¢ with Kleene brackets, can be 
expressed in HA as J!yA(y), where A(y) is a conjunction of prime 
formulas in HA; hence ¢ = x can also be expressed as A! y(t, = ¢,), for 
suitable ¢,, ¢,. Then ~,rA A Ey, is logically equivalent to a formula 


3xdy(t, =2,) A Vyx(t, =t, > xr A). 
We leave the details as an exercise. O 


4.7. PROPOSITION (idempotency of realizability). 
HA + Ax(xray(yrA)) © dy(yrA). 
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PrRoor. <— : yrA is almost negative, so (proposition 4.5) yrdA- 


(Yoyeay (VFA) A Ediyray, hence yrd > (1% Yyray) Fav(y FA) A 
Ej(¥, Yiyray) and so Ax(xrdy(yrA)). > is even easier: if xrAy(yrA), 
we have j,.xr(j,x 1A), and by 4.5() j,xrA ie. dx(xrA). O 


4.8. DEFINITION. The extended Church’s thesis is the following schema, 
where A is almost negative: 
ECT,  Wx(Ax > AyBxy) ~ 3zVx(Ax > Ju(Tzxu A B(x, Uu))). O 


4.9. LEMMA. For any instance F of ECT) there is a term t such that 
HA?t Et A trF. 


Proor. Let F=Vx[A > 3yBy] - 3zVx[A > du(Tzxu A B(Uu))}. As- 
sume utVx[A — 3yBy]. We put 


t= {{u}(x)} (v4). 
Then 
Vx[A > Et A trayBy] 
and thus 
Vx[A > Et A jtrBly/fjt]). 


Take t, = Ax. j,t, t, = min,T(t,, x, v). Then Vx[A > j,tr Bl y/t,x]] and 
thus for 


i = j(t,, AxAw. j(t,, J(0, it) 
we find *r3zVx[A > Au(Tzxu A B(Uu))]. Therefore Au.t*rF. O 


Note that Aw. ¢* depends on A only! 


4.10. THEOREM (characterization of realizability). 
(ij) HA + ECT, A @ Ax(xrA), 
(ii) HA + ECT, + A @ HAF 4x(xrA). 


PROOF. (i) is readily proved by induction on the complexity of A, with help 
of lemma 4.6; we leave the details as an exercise. 

(i) <= follows from (i). = is proved by induction on the length of 
deductions in HA + ECT, (“= ” is the soundness theorem for realizability). 
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The proof is quite straightforward if we base ourselves, for example, on a 
Hilbert-type formalization of HA (such as the formalization obtained by 
taking H,-IQC of 2.4.1). 
As the basis of the induction, we have to indicate terms realizing the 
axioms. 
Logical axioms. 
Ax. j,x01(A A BA); Ax. j,xr(A ABB), 
AxAy.j(x, y)t(A > (B> AA B)), 
Ax. j(0,x)r(4 > AV B); Ax.j(1,x)r(B> AV B), 
AxA yAz.[(1 ~ sg(fz))({x}(z)) + se(az({y}(Az))] 
r((A > C) > [(B>C) > (AV B>C)), 
Au. j(t, u) (A(t) > 4xAx), 
Audv.{{{u}(jv)}(iv)) 1 (Vx(A > B) > (axd > B)) 
(x not free in B), 
AuAv.ur(A > (B->4)), 
AuAvdw.{{u}(w)}({o}()) 
r([4 > (B>C)] > [(A > B) > (4 >))), 
Au.{u}(t)r(WxAx > At), 
AuAvAx.{ {u}(x)}(v) r(Wx(B > A) > (B > VxA)) 


(x not free in B). 


Let us pause for a moment to check e.g. the case of (A > C) > [((B > C) 
—-(AVB->C)]. Suppose xr(A > C), yr(B > C), zr(A V B). Then 
if j,z=0, j,z1rA, so {x}(j,z)rC; and if j,z #0, then j,zrB, and 
{ y}(j.z)rC. It is now clear that, if ¢ is the term for this axiom schema 
indicated above, then {{{t}(x)}(y)}(z) always realizes C. We leave the 
verification of other cases to the reader. 


Non-logical axioms. Equations are all realized by 0; an implication between 
prime formulas such as x = y ~ Sx = Sy by Au.0 etc. So the only non- 
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trivial case is induction. Suppose wr(A(0) A Vx( A(x) > A(Sx))), then 
jyurA(0), 
jyutrVx( A(x) > A(Sx)). 


Using the fact that the partial recursive functions are closed under (primi- 
tive) recursion we define an index t(u) such that 


{t(u)}(0) = ju, 
{t(u)}(Sx) = {{ jou }(x)}({t(u)} (x). 
By induction on x one then shows {1(u)}(x)r A(x), so 
Au.t(u)r(A(0) A Vx( A(x) > A(Sx)) > WxA(x)). 
ECT), has already been treated in lemma 4.9. 


Rules. Suppose tr(A — B), t’r A. Then {t}(t’) is also defined and realizes 
B. If we have shown tr A(x), then clearly Ax.t rVxA(x). So the property of 
being realizable is transmitted from the premisses to the conclusion. O 


REMARK. (ii) can be refined to: for each sentence A such that HA + A 
there is a numeral 7 such that HA + nrA (E4.4.3). 


4.11. ECT) is stronger than CTy, ie. HA + CT) # F for some instance F 
of ECT, (cf. 10.3). On the other hand ECT, cannot be strengthened to 
ECT*, the schema obtained by dropping the condition “A almost negative” 
for ECT). In fact, even the following special case 


ECT*!  VWx[Ax > 3!yB(x, y)] 
> 42Vx[Ax > E{z}(x) A B(x, {z}(x))] 
conflicts with HA. To see this, take 
A(x) = 3uTxxu V “AuTxxu, 
B(x, y) = (AuTxxu A y = 0) V (-duTxxu A y = 1). 


We leave it to the reader to show that this particular instance of ECT*! is 
incompatible with HA (E4.4.4). 

Observe that if the premiss of ECT*! holds, then B describes the graph 
of a function with domain D = {x: Ax} C N. We cannot always assume 
such functions to be partial recursive. This shows that AC-DN! contradicts 
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the assumption that all partial functions from N to N are partial recursive 
(“partial Church’s thesis’’). Cf. also our remarks in 2.1. 


4.12. Realizability provides a systematic method for deciphering the con- 
structive meaning of logically compound formulas, but it does not give any 
information about the constructive meaning of 4-free statements. 

As an example, consider functions from N“ > N. By CTp, all functions 
given by a graph predicate A, i.e. a predicate such that Vxd!yA(x, y), are 
total recursive. Functions from N“ > N can be given by a graph predicate 
A, i.e. a binary predicate A(x, y) such that 


Vx € V,(4yA(x, y) A Vx’ EV (x =, xA A(x’ yy’) oy =y’) (1) 


where x € V, = WudvTxwo (i.e. x is the code of a total recursive functior 
and x =, x’ = Vy({x}(y) = {x’}())) (that is to say x and x’ represent 
extensionally the same function). 

Now Vx & V, 4yA(x, y) is of the form Vx(WuavTxuv > AyA(x, y)); by 
ECT, it follows that for some z Vx € V,(E{z}(x) A A(x, {z}(x))). More- 
over, by (1) z should also satisfy 


xEV,AX' EV, Ax =, x’ > {z}(x) = {z}(x’). 


Thus we know that under the realizability interpretation functionals in 
N"™ — N can be interpreted by partial recursive operations. 

Essentially the same method of constructively re-interpreting the mean- 
ing of logically complex statements has been proposed by Shanin as the 
basis for CRM (cf. Shanin 1958, 1958A, Kleene 1960). It turns out that 
almost the whole mathematical practice of CRM can be formalized in 
HA + ECT) + MRopr, where MRopp is the simplest form of Markov’s 
principle: 


MRpgp 2 = XAx ~ AxAx (A primitive recursive), 


a principle adopted by the Russian constructivist school. MR ppg will be 
discussed in the next section. 

On the other hand, 3x(x 1A) as we have defined it here, is not accepted 
as the fundamental interpretation of the constructive meaning of “A is 
true”, in CRM, for the following reason. Shanin (1958) regards the con- 
structive meaning of negative formulas as immediate. Nevertheless, realiz- 
ability transforms such formulas into more complex formulas (albeit logi- 
cally equivalent ones; but this can be established only after the semantics 
has been correctly explained): xrA is more complex than A, also for 
negative A. Therefore Shanin gives a method for “deciphering the construc- 
tive meaning of statements” in which J-free statements are not transformed 
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into more complex statements. Afterwards one can establish equivalence 
with realizability relative to HA. 

Shanin’s method is much more complex to describe than Kleene’s realiza- 
bility; and as follows from the equivalence of these interpretations relative 
to HA, the difference in semantic framework has no consequences for the 
practice of constructive mathematics. For these reasons we have restricted 
attention to realizability. 

ECT, justifies certain choice axioms such as AC, and DC discussed in 
section 2, but also the presentation axiom relative to HA: if we assume 
ECT), then any arithmetical {x: A(x)} is a surjective image of A’ = 
{(x, y): yr A(x)} and choice over the domain A’ is a consequence of 
ECT), since A’ is given by an almost negative predicate. 


4.13. The formal treatment of realizability. As noted above, though the 
realizability definition itself can be given entirely in HA, the treatment of 
the soundness and characterization results is made much easier if we 
assume Kleene brackets to be present in the arithmetical language, and 
accordingly we use a logic with existence predicate (E-logic or Et-logic, cf. 
section 2.2). Then “Ez” is a new prime formula, and realizability can now 
be specified by the following clauses 


G) trP := Et A P for P prime, 

Gi) tr(A A B) = (jt) A (jt), 

(iii) tr(A > B) = Et AVx(xrA > txrB), 
(iv) trVxA = Vx(tx r A), 

(v) tr4axA = jt A(j,0). 


Since V is definable we do not need a clause for V; if we wish to treat V 
as a primitive, we can add 


trAV B= (jt=OAZtrA)V ({tFOA/tB). 


We now show by induction on the complexity of A that (trA) > Eft is 
provable. The details of the formal treatment are similar to the treatment in 
the abstract setting of section 9.5. 


5. Markov’s principle 


5.1. In CRM, the following principle is accepted: 
MP Vx(Ax V 4Ax) A =A45xAx > AxAx 
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(Markov’s principle, also often denoted by ““M” in the literature), or in a 
form with function variables 


a74x(ax = 0) > ax(ax = 0). 


Of course, in the setting of CRM, a is an algorithmic (recursive) 
function, and A an algorithmically decidable predicate. Thus MP may be 
paraphrased as: when there is an algorithm for testing A, and by “indirect 
means” we know that we cannot avoid encountering x such that Ax, then 
in fact we can find x such that Ax: the testing algorithm provides the 
method for finding x. 

One is tempted to object to MP, that —-3x(ax = 0) gives us no effective 
method of actually producing an n © N such that an = 0; however, this 
objection does not help in the case one accepts a proof of —4x(ax = 0), 
together with an algorithm for a as a method for finding x (so to speak “by 
definition”). The acceptability of MP within the context of general con- 
structivism is therefore not all that easy to decide. 

N.B. In the presence of AC-NN!, the principle with function variables is 
equivalent to MP for predicates, but the two forms are not equivalent in the 
absence of any form of function comprehension. Thus for the lawless 
sequences of 6.2(a), discussed at length in chapter 12, the functional form of 
MP is refutable, while MP for predicates not containing choice parameters 
free is consistent with the theory LS of lawless sequences. 

Since the general schema of the BHK-interpretation still leaves us a lot of 
freedom, one might seek a solution in the form of a modification of the 
BHK-interpretation which so to speak makes MP automatically true. In 
fact, as shown by Gédel (1958) it is possible to give an interpretation of 
logic in the context of Y(HA) which makes MP true without having to 
assume it beforehand. That is to say, for a suitable translation ° we have 


HA + CT, + MP + IP, bk A = HAE A?, 
where AP = A for II?-formulas, and IP) is the schema 
IP,  Wx(A V AA) A (Wxd — 3yB) > 3y(VxA > B) 


(y € FV(A)). The translation ? may be extended to other languages and 
stronger systems; cf. 10.4. 


5.2. In the presence of CT, MP is equivalent to its special case 


MPpp 9 ad xA(x) > 3xA(x) (A primitive recursive); 
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in fact this schema is easily seen to be equivalent to a single axiom 
Vxy(—32zTxyz > 3zTxyx) 

(F4.5.2). 


5.3. Markov’s principle and Post’s theorem. Post’s theorem, well-known 
from recursion theory, states that if a set and its complement are both 
recursively enumerable, then the set is recursive. The proof of this fact uses 
MPppr: if Zo, 2; are codes for enumerating X and its complement, then 
Vxi74 (Tz xy V Tz,xy) hence, by MPpp, Vxdy(Tzoxy V Tz,xy), and 
this gives in turn 


Wxalz[(z =0 A FyT (2p, x, y)) V (Zz = 1A AyT (21, x, ¥))] 


from which one readily obtains a characteristic function for X. An abstract 
version of Post’s theorem (for the case of an inhabited set with inhabited 
complement) in which the recursiveness has been dropped is 


PT xy( Bx # yy) A Wx—4y(By = x V yy = x) 
> Wxdy((By = x) Vv (yy =x)). 


PT is in fact equivalent to MP (exercise). 
An interesting mathematical consequence of MP is given by the following 


5.4. PROPOSITION. MP is equivalent to Vx € R(x #0 > x #0) where R is 
the set of reals as defined in 1.2.3, namely as the set of equivalence classes of 
fundamental sequences, and x # 0 means 3k(\x| > 27*). 


ProoF. The detailed treatment of the theory of the real numbers is reserved 
for the next chapter, but that need not bother us here. Let the fundamental 
sequence (r,,), determine the real x; without loss of generality we may 
assume Wn(|r, — x| < 27"). If x = 0, this means Vn(|r,| < 27”), and there- 
fore x # 0 is equivalent to 4Wn(|r,| < 2~”) and hence to —44n({r,,| = 27”). 
With MP we get |7,|2= 27” for some n, and since |r, — x|< 27”, Le. 
Ir, — x|< 27" — 27” for some m, it follows that |x| > 27”. Conversely, 
suppose —44x(ax = 0). We construct (7), as 


r= 


0 if Vm < n(am # 0), 
2°" ifm<n,am =0,Vm' < m(am’ # 0); 
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then (r,), is a fundamental sequence determining a real number x, and 
x #0. Assuming Vx € R(x + 0 > x#0) we then also have x#0, ice. for 
some k |x| > 27*. Then clearly Jm < k(am = 0) (otherwise |x| < 27*). 

oO 


5.5. Realizability in the presence of MP. Assuming MPpp, the notions of 
“almost negative formula” and “negative formula” coincide modulo logical 
equivalence, since by MPpp 3x(t = 5) @ AVx—-(t = 5). This fact permits a 
characterization of realizability by a variant of ECT, which is in certain 
respects to be preferred. 


PROPOSITION. Assuming MPpp, ECT, is equivalent to the schema 
ECT,  Vx[Ax > JyB(x, y)] 
> 4zVx([5Ax > du(Tzxu A B(x, Uu))]. 


Proor. Exercise. O 


ECT) is more convenient to handle than ECT) since we do not need 
syntactical restrictions on A(x). This advantage becomes particularly clear 
when one wishes to extend the characterization of realizability to second- 
order arithmetic. Especially in combination with MP (or MPpa) the schema 
ECT, (or ECT) yields interesting mathematical consequences not obtain- 
able by CT, alone. Among those consequences we mention in particular the 


THEOREM. Assuming ECT, + MP, all mappings from complete separable 
metric spaces into metric spaces are continuous. 


PROOF. See 6.4.12 for the case of mappings from R into R, and 7.2.11 for 
the general case. O 


6. Choice sequences and continuity axioms 


6.1. In 1917 L.E.J. Brouwer introduced choice sequences in intuitionistic 
mathematics, in order to capture more satisfactorily the intuition of the 
continuum. . 

Choice sequences have given traditional intuitionistic mathematics its 
characteristic flavour. The idea of a choice sequence is that of a sequence as 
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a process, not necessarily predetermined by some law or algorithm, succes- 
sively producing elements of a sequence. The possibility of using choice 
sequences mathematically is given by the fact that in practice all operations 
mapping sequences to other sequences or to natural numbers are continu- 
ous, that is to say can be carried out by operating on (computing from) 
initial segments. 

In the introduction (1.3.6) we observed that we were unable to give an 
example of a total discontinuous function from R to R, which suggests that, 
constructively, all total functions from R to R are continuous. The absence 
of an example of a total discontinuous function is not a proof of the 
conjecture however. 

Permitting fundamental sequences to be given by choice sequences results 
in a wider notion of real number, for which the statement “all total 
functions from R to R are continuous” can be proved by means of 
plausible axioms for choice sequences (cf. theorem 6.3.4). This should not 
greatly surprise the reader, since the condition “f is total on R” becomes 
much stronger if we extend R by permitting also reals defined via funda- 
mental sequences given by choice sequences. In other words, the meaning of 
“f is total on R”, when permitting choice sequences with intuitionistic logic 
in the definition of reals, is very different from its classical meaning. 


6.2. Varieties of choice sequences. In this section we are, deliberately, not 
very specific about the way one should think about choice sequences. We 
illustrate the ample possibilities and pitfalls by means of two special 
examples. 

(a) Lawless sequences. A lawless sequence is to be viewed as a process 
(say a) of choosing values a0, al, a2,... € N such that at any stage in the 
generation of a we know only finitely many values of a. If we generate a 
lawless sequence, then we have a priori decided not to make general 
restrictions on future values at any stage. Thus at each stage we can only 
choose finitely many new values (or exactly one) of the sequence; we 
cannot, for example, decide that all values to be chosen after stage ¢ are to 
be even. 

A good “picture” (intuitive model) of a lawless sequence is that of the 
sequence of casts of an (abstract) die, permitting perhaps finitely many 
deliberate placings of the die in the beginning. We can throw the die as 
often as we like, but at any stage we only know the result of finitely many 
casts. 

The very idea of a lawless sequence implies that we can never assert that 
its value will coincide with the values of some lawlike sequence, since this 
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would result in an overall restriction. Let a be a variable for lawlike 
sequences. In order to be able to assert Ja(a = a), we must know at some 
stage that a = a for some lawlike a, which is clearly impossible. Thus we 
have 


Va—Ja(a =a). 


One should contrast this with the behaviour of the 

(b) Hesitant sequences. The name “hesitant sequences” is ad hoc, and the 
notion is discussed here only for purposes of illustration. A hesitant 
sequence (say 8) is a process of generating values BO, B1, 82,... © N such 
that at any stage we either decide that henceforth we are going to conform 
to a law in determining future values, or, if we have not already decided to 
conform to a law at an earlier stage, we freely choose a new value of . 
Thus at any stage in the construction of 8 we either know only finitely 
many values of 8, or we know that all future values of 8 will be chosen 
according to a law. 

Therefore the lawlike sequences are in some sense contained in the 
hesitant sequences: they correspond to those hesitant sequences for which 
already at stage 0 one decides to conform to a law. Also, for a ranging over 
hesitant sequences 


Va-da(a — a). 


For to have a proof of =Ja(8 = a) for some hesitant 8 would mean that at 
some stage we would be sure that 8 would never conform to a law; but the 
very idea of a hesitant sequence prevents this, since we always may decide 
later on to become law-abiding, and therefore .Ja(B = a). 

On the other hand, we cannot expect Vada(a = a) to hold for hesitant 
sequences, since we have no method for assigning a law to each hesitant a; 
the decision whether or not to conform to a law may stay open indefinitely 
(hence the name “hesitant’). 

As shown by the conflicting properties -—Ja(8 = a), Ja(a = a), the 
universes of lawless and hesitant sequences are disjoint, notwithstanding the 
fact that at first sight hesitant sequences may seem to be very similar to 
lawless ones. 

In what follows we shall assume choice sequences to embrace all possibil- 
ities from lawless to lawlike. The lawless sequences will be treated in 
considerable detail in Chapter 12. 


6.3. Weak or local continuity WC-N. Suppose we are given some func- 
tional  € NN > N defined for a// choice sequences. Then the fact that ® 
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is total, i.e. everywhere defined, and that for an arbitrary choice sequence 
we cannot, in general, assume more than an initial segment to be known at 
any given moment leads to the supposition that ® must be continuous on 
NN, that is to say 


ValAxVB € ax(B = Ga). 
Combining this with a choice principle 

VadxA(a, x) > Id ENN SN VaA(a, Ga) 
one is led to the schema of weak or local continuity for numbers 
WC-N — Wa4xA(a, x) > VadxdyVB € ax A(B, y). 


The term “‘local” refers to the fact that the schema only states the existence 
of a suitable y for a whole neighbourhood of a for each a individually. Via 
the equivalence (A V B) @ Ax[(x = 0 A A) V (x #0 A B)] we obtain a 
version of WC-N for disjunctions: 


WC-NY  Wa( Aa V Ba) > Wadx(VB € ax A(B) V VB © ax B(B)). 
Weak continuity conflicts with classical logic, in much the same way as in 
the case of CT), as the following two propositions show. 


6.4. PROPOSITION. WC-N Y refutes V-PEM. 


PROOF. Assume V-PEM, i.e. 
Va(Vx(ax = 0) V AVx(ax = 0)), 
then by WC-N Y 
Vady(WB € ayVx( Bx = 0) V VB € ayVx(Bx = 0)). 
Now specialize a to Ax.0, then for some y, with n = (Ax.0)(y) 
VB € nVx( Bx = 0) V VB € n=Vx( Bx = 0). 
The first disjunct is false, as may be seen by taking any B € n * (1); and the 


second disjunct is also false, as follows by choosing B = a =Ax.0. O 


REMARK. It is worth noting that this proof explicitly uses the fact that 
certain lawlike sequences (i.e. Ax.0) are part of the universe of choice 
sequences. In the presence of FAN* (7.3) or bar-continuity (8.10) -4-PEM 
can be proved without this assumption (cf. E4.7.6, E4.8.12). 
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COROLLARY. The logical principles PEM and Va—A > ——VaA are not 
generally valid. 


Proor. Cf. 3.4. O 
6.5. PROPOSITION. WC-N Y proves —=SEP. 


ProoF. Let us assume SEP, ie. 
VaB(A(4x(ax = 0) A dx( Bx = 0)) 
> Adx(ax = 0) V 43x( Bx = 0)). 
Let y be arbitrary; we define y’, y” from y by 


witli fs if y(2x) =0 A Wy < 2x(yy # 0), 
1 otherwise, 


sits (° if y(2x +1) =OA Wy < 2x + (yy #0), 
1 otherwise. 


Clearly 4(3x(y’x = 0) A 4x(y”x = 0)), for all y, hence by SEP 
Vy[=4y(y’y = 0) v -ay(yy = 0)] 

and thus with WC-N Y we find for any y a y such that 
V8 € ¥y—dy(6’y = 0) V VS € Yy3y(8”y = 0). 


Apply this to y = Ax.1. Then the first part of the disjunction becomes false 
for any 6 & (Ax.1)(2y)* (0) and the second half of the disjunction becomes 
false for 6 € (Ax.1)2y + 1)*# 0), and so we have arrived at a contradic- 
tion, i.e. “SEP. O 


6.6. We mention some interesting mathematical consequences of WC-N, 
such as 

(i) If {X,:i€N} covers a complete separable metric space, then 
{Int(X,): i € N} is again a cover (cf. 6.3.3, 7.2.6). 

(ii) As a corollary to the preceding property one obtains: each function 
from a complete separable metric space to a separable metric space is 
continuous. In particular functions from R to R are continuous (6.3.4, 
7.2.7), 

(iii) WC-N ‘can also be used to prove certain classically valid results 
which, however, are usually proved by constructively invalid reasoning. For 
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example, all sequentially continuous functions from a separable metric 
space to a metric space are continuous (cf. 7.2.8). Here “sequentially 
continuous” means that convergent sequences are mapped to convergent 
sequences, On the other hand we have 


6.7. PROPOSITION. WC-N refutes CT. 


Proor. Apply WC-N to VadxVy4z(Txyz A ay = Uz). O 


Instead of WC-N one may also consider the more cautious principle of 
weak continuity with uniqueness 


WC-N! — Wad!xA(a, x) > VadxdyVB € ayA(B,x). 


This corresponds to assuming continuity only for those © € N* > N which 
are already given to us as extensional, ic. a = B > ®a = OB. (Here and 
elsewhere in this chapter, the predicates considered are assumed to be 
extensional in their choice parameters.) WC-N! does not obviously conflict 
with CT; in fact, as follows from E6.4.9 and theorem 7.2.7, WC-N! is 
derivable from ECT, + MP. 

We now turn to a strengthening of weak continuity. 


6.8. Neighbourhood functions and weak continuity. WC-N does not exhaust 
our intuitions about continuity. For if ® €@ NN > N is any continuous 
functional on N%, it seems plausible to assume that we actually know 
whether an initial segment of a suffices to compute ®a. That is to say, we 
can actually associate with ® a “neighbourhood function” g € N > N such 
that 


(an) = 0 = an is not sufficiently long to compute ®a from it; 


y(an) = m+ 1 @ an is sufficiently long to compute Ga from it, 
and ®a = m. 


What is more, the natural way of presenting a continuous functional 
® < NX —N seems to be via such a neighbourhood function. Clearly, the 
neighbourhood function satisfies (i) Vadx(p(ax) # 0), and (ii) Vam(gn # 0 
— gn = y(n*m)). (i) is a consequence of © being everywhere defined, 
and (ii) is due to the fact that the value computed should not change if 
more information about the argument becomes available. Conversely, any 
satisfying (i) and (ii) defines a continuous ® € NN >N by 


®a =n = 4 m(g(am) =n+4+1). 
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Thus, if we define Ko, the class of neighbourhood functions by _ 

y € Ky = Wadin(y(an) #0) A Vam(yn #0 > yn = y(n*m)), 
and define 

y(a) = n:=Am(y(am) =n +1), 


(the continuous functional given by y € Ky assigns n to a), we can now 
express a global form of continuity by 


C-N WadxA(a, x) > Jy € KWaA(a, y(«)) 


((strong) continuity for numbers, sometimes also denoted by CONT)). 


REMARK. “an is sufficiently long for computing ®a” should not be under- 
stood as merely saying that ® is constant on V,,,, but rather that the method 
for computing ©, by which © is constructively given to us, requires an for 
its execution. 

The following example makes it clear that the first interpretation is to be 


excluded. Specify ®, depending on 8 by 


teva if Vy < a0(By = 0), 
Be if dy < a0( By = 0). 


Clearly the method for computing ®,(a) requires knowledge of a0 only, ie. 
of a1. However, if —3y(By = 0), ®, is constant on Vz) = Vi). So if we 
could decide whether ® is constant on Y ) or not, we could also decide 
whether 4y( By = 0), for arbitrary B, i.e. J-PEM would follow. 

As an illustration we give in fig. 4.1 the picture of a neighbourhood 
function y for the functional ®: a > .a(a0). 


6.9. CT), WC-N, C-N and their impact on logic. If we think of the 
constructive mathematical universe as consisting of lawlike mathematical 
objects, and if we also accept the identification of lawlike with recursive for 
number theoretic functions as constructively justified, we have no reason to 
doubt the compatibility of intuitionistic predicate logic with CT). 

Similarly, as soon as we have decided that choice sequences are legitimate 
objects in intuitionistic mathematics, and constitute a domain over which 
one can quantify, we need not doubt the validity of intuitionistic predicate 
logic over this domain. The continuity axioms WC-N and C-N express 
properties of the quantifier combination Vain not already implicit in the 
BHK-explanation, which are seen to hold by reflection on the way (data 
conceming) choice sequences are given. 
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If on the other hand, we do not regard CTy as justified by our intuitive 
understanding of constructive mathematics, the adoption of CT) amounts 
to a strengthening of Vnidm as given by the BHK-interpretation. This 
requires a rethinking of logic: we have to convince ourselves that the laws 
of IQC remain valid on this stronger reading of “Wn4dm”. As we have seen 
in the preceding section, this can be done by taking everywhere in the 
BHK-interpretation the operations on proofs to be (partial) recursive with 
respect to numerical information contained in constructive proofs; this 
leads in a natural way to realizability. 

Similarly, if we doubt the coherence of the notion of choice sequence, or 
the analysis which led to WC-N and C-N, then we have a comparable 
situation in the case of the adoption of WC-N or C-N; and independent 
confirmation of the applicability of the laws of IQC becomes desirable. 
(This matter will be discussed in greater detail in chapter 12.) 

In the following subsections we shall discuss some logical relationships 
between various forms of the continuity axioms, and one further 
strengthening of continuity, namely continuity for quantifier combinations 
Vad. One easily proves the following 


6.10. PROPOSITION. 
(i) AC-NN follows from C-N. 
(ii) C-N is equivalent to the following version 


C-N’ VadnA(a,n) > Jy € KWa(yn # 0 > Wa €nA(a, yn ~ 1)). 
PROOF. (i) Assume C-N, Wx4yA(x, y); then also VadyA(a0, y), so for 
some y © Ky VaA(a0, y(a)), hence specializing to a = Az.x 
VxA(x, y(Az.x)), and thus with B = Ax.y(Az.x) we find A(x, Bx). 

(i) Assume C-N’, VadnA(a, n); let a be arbitrary. By C-N’ there is a 


y © Ky such that if yn # 0, then Va € n A(a, yn — 1); suppose y(ax) # 0, 
then y(a) = y(@x) ~ 1; and since a € ax, we find A(a, y(a)) etcetera. O 


6.11. Another convenient form of C-N is 
C-N” VWadnA(a,n) > Fy € KWn(yn #0 > AxVa € nA(a, x)), 
which readily can be generalized to the following principle 
C-D Vadd € DA(a,d) > 
dy © KWn(yn # 0 > Ad € DVa En A(a, d)), 
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where D is any domain of lawlike, completely determined objects. We have 
the following 


PROPOSITION, C-N” + AC-NN is equivalent to C-N and to C-N’. 


Proor. Assume AC-NN, C-N” and let VadnA(a, n); then for some y € Ko 
Vn(yn #,0 > ImVB € nA(B, m)), (1) 
which by the decidability of yn + 0 yields 
Vnim(yn # 0 > VB EnA(B, m)) 


(choose m as given by (1) if yn # 0, 0 otherwise). With AC-NN there is a 6 
such that 


Vn(yn #0 > VB E€nA(B, 6n)). 
If we put 
2 0 if yn = 0, 
aor eae ifm<nA ym #0 A Vm’ < m(ym’ =0), 
then 
Vn(y'n #0 > Va EnA(a,y'n~1)). O 


6.12. The neighbourhood function principle. The following principle con- 
nects C-N and WC-N 


NFP — WaaxA(ax) > Jy © KWaA(&(y(a))), 


as becomes clear from the following 
PROPOSITION. C-N is deducible from NFP + WC-N. 
Proor. Exercise. O 


In earlier publications (e.g. Kreisel and Troelstra 1970) NFP was called 
SC-N (“Special Continuity for Numbers”), but we have dropped this 
designation, since it is not very informative and, possibly, even misleading. 
Formally NFP is a special case of C-N. However, it is not a typically 
intuitionistic continuity principle since it is classically valid. Intuitively, 
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NFP says that, when A is a property of initial segments of sequences, then 
initial segments of sequences satisfying A can be found by a neighbourhood 
function. 


6.13. Generalization to arbitrary spreads. If T is any spread, we easily 
deduce the following generalizations of C-N and WC-N respectively: 


C-N(T) Va,dxA(a,x) > Jy € KWa,A(a, y(a)), 

WC-N(T) — Va,3xA(a, x) > VardxdyVp; € ax A(B, y). 

To see this, let [ be a mapping of N™ onto T such that Ta = a for a € T 
(cf. lemma 1.4), then from Va,4xA(a, x) we first conclude VaaxA(Ia, x), 
hence by C-N also Jy € KWaA(Ta, y(a)) and therefore Jy € Ky 


Va,A(a, y(a)), etc. Of course, one can also formulate generalizations 
C-N’(T), C-N’”(T) similar to C-N’ and C-N” in 6.10, 6.11 above. 


6.14. Mathematical consequences of C-N. From C-N (in fact, already from 
NFP) one obtains an intuitionistic version of Lindeléf’s theorem: every 
open cover of a complete separable metric space has an enumerable 
subcover (cf. E4.6.5, E6.3.7 and theorem 7.2.10). Another example is the 
theorem: each f € R — R is continuous and has a continuous modulus of 
continuity (see E6.3.8-9). 


6.15. Function continuity. A considerably stronger form of continuity oc- 
curring in several axiomatic systems for intuitionistic analysis is function 
continuity 


VadBA(a, 8B) > 30 € CONT VaA(a, Ta), 


where CONT is the class of continuous functionals from N% to N%. 
We can express this in terms of neighbourhood functions as follows. Let 


yla = B = Wn(¥,)(a) = Bn), 
where 

¥, = Am.y(k *m), 
or more explicitly 

yla = B= Vx((yla)(x) = Bx) = ¥x(Any((x)*n)(a) = Bx). 
Since 


y € Kyo ax(y=Ay.xt+ 1) V (¥¢ Y=O0A Vn (Yn) € Ky)), 
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any y © Ky may serve as a code for a sequence of continuous functionals 
of type N“ +N, precisely what is needed to code a single continuous 
functional of type NN > NN". Function continuity may now be formulated 
as 


C-C  VWa3BA(a, 8B) > Jy € Ky WaA(a, ya). 


C-C is also denoted by CONT,. No interesting mathematical applications 
of C-C are known, though it is of foundational interest. We note the easy 


PROPOSITION. C-C implies C-N. O 


7. The fan theorem 


7.1. By the introduction of choice sequences, Brouwer not only hoped to 
capture the intuition of the continuum, but also to save a core of traditional 
analysis. In particular, he wanted to preserve the theorem that a continuous 
function from a segment (= closed interval) of finite length to R is 
uniformly continuous. 

Brouwer obtained this result as a corollary of his “bar theorem”, to be 
discussed in the next section. We prefer to discuss first, separately, the most 
important corollary to the bar theorem, the so-called fan theorem. This 
corollary suffices for the theorem on continuous functions just mentioned; 
in fact, all the known mathematically relevant applications of the bar 
theorem are actually consequences of the fan theorem. 

We shall first discuss some forms in which the fan theorem appears as an 
axiom, and show that it conflicts with CT; at the end of the section we 
briefly discuss the intuitive motivation for accepting the “fan theorem” as 
an axiom. 


7.2. The principle FAN). In its simplest form, the “fan axiom” applies to 
finitely branching spreads, i.e. fans; it says that if T is a fan, and for each 
infinite branch a in T there is an initial segment ax satisfying the decidable 
property A, then there is a uniform upper bound to the x involved. For a 
given fan T this can be expressed as 


FAN,(T) = -Wn7-(An V GAn) A WazaxA(ax) > 3zVazdy < zA(ay). 
The general form is 
FAN) fan(T) > FAN,(T), 


where fan(7’) formally expresses that the set T is a fan. 
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Intuitively, FAN,(7) expresses a restricted form of compactness of T 
(with the topology generated by the basis {V,:n € T}, where as before 
V, = {a © T: a © n}), namely that any decidable set of basis elements 
covering 7 has a finite subcover. (The decidable set of basis elements is 
{V,,: A(n) An © T}, the finite subcover is found by taking {V,: A(n) A 
n€TA lth(n) < z} with z as given by FAN/)(T).) 

We may think of FAN) as an intuitionistic version of K6nig’s lemma. 
Classically, K6nig’s lemma and FAN) are equivalent. For let T be any fan, 
and let us simply write a, n for a;, n; etc. Assume VadxA(ax), then 


T, = {m:Vm' < m—Am'} 


is a tree with all branches finite. Now suppose that 44zVady < zA(ay), 
ie. VzdaVy < z4A(ay) by classical logic; this means that T, has arbi- 
trarily long finite branches, and hence by K6nig’s lemma there is an infinite 
branch in T, i.e. JaVz—A(az), which contradicts VadzA(az). We leave the 
proof of the converse (i.e. FAN,(7) implies Kénig’s lemma by classical 
logic) as an exercise. 


7.3. The principle FAN. Full compactness of the fan T can be expressed by 
the following strengthening of FAN)(T) 
FAN(T) = Wa,axA(ax) > 4zVa;dy < zA(ay). 
To see this, let {W,: i © I} be any open cover of T. Then 
Va,dxdi € I(V;, C W,). 
By FAN(T) we find 
3zWa,dy < 239i € I Vy SC W,), 
hence a fortiori 
4zVa,di € I(Vz, Cc W,). 


There are only finitely many initial segments of length z in T, say 
No,---, N,3 choose for each n,a W, such that Vs, Cc W, then {W,, sok W,} 
is the required finitely indexed subcover. It is easy to see that conversely 
compactness of T immediately yields FAN(T). 

The general form is 


FAN _ fan(T) > FAN(T). 


All the more important mathematical consequences of the fan axiom can 
be obtained from the version FAN. FAN is still classically valid, though 
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this is not any longer so for the following generalization (exercise): 
fan(T) A Wa;dxA(a, x) > 3zWa,dy < zA(a, y) 
and a fortiori not for the combination of FAN with continuity: 
FAN*  fan(T) A Wa,;axA(a, x) > JzVa74IxVB € azA(B, x). 


We note the following 
7.4. PROPOSITION. 
(i) FAN(T) follows from FAN)(T) + NFP (cf. 6.12). 


(ii) Assuming C-N, the principles FAN)(T), FAN(T), FAN*(T) are all 
equivalent; FAN*(T) is equivalent to FAN)(T) + C-N(T). 


PRoor. We show (i); (ii) is proved similarly. Suppose Va;4xA(ax). With 
NFP we find a y € Ky such that 


VarA(@(y(a))). 
Now consider 
A*(n) = yn #0 A Ith(n) > yn = 1. 


Obviously A* is decidable, and Va7dxA*(ax) holds (using the two essential 
properties of a neighbourhood function). By FAN)(T) we find 


3zVa;dy < zA*(ay), 
that is to say 

azVa,ay < z(y(ay) #OAy2 (ay) + 1). 
So with (1) 

azVapdy < z4y’ < yA(ay’), 


from which the required conclusion immediately follows. O 


7.5. Reduction of FANp, FAN to the case of the binary tree. So far we 
stated our schemata for arbitrary finitely branching trees. But in fact the 
case of the binary tree 7), consisting of all 01-sequences is paradigmatic, as 
shown by the following 


PROPOSITION. FANp, FAN are derivable from FAN)(T>,), FAN(Tp,) re- 
spectively, 
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ProoF. The proof is based on the following two facts (a) and (b). 

(a) If T’ is a subfan of the fan 7, there is a continuous mapping I from T 
onto T’ such that Ta = a for a € T’; in fact there is a I’ € N > N such 
that I’(ax) = (Ta)x (we already indicated such a T in 1.4). 

(b) If T is any fan, there exists a homeomorphism ® from T onto a 
subfan of the binary fan. Given T and a € T, we can define ®a as the 
sequence 


o+1 1+1 2+1 3+1 
Lee Oe On Os On. 


where 1” stands for a sequence of n times 1. If Bn == max{an:n € T}, 
then £8 is well-defined because T is a fan. ax obviously determines 
(®a)(x), and (~*w)(x) can be determined from a(x) where 


+2x+1. 


o(x)=[¥ 8 


i=0 


Just as in the proof of C-N(T) from C-N (cf. 6.13), we obtain from (a) that 
FAN,(T) implies FAN,(T’) etcetera. 

From (b) we obtain that FANp(T) follows from FAN)(7") for some 
subfan T’ of J, (and similarly for FAN, FAN*). This may be seen as 
follows. Let Va;IxA(ax), and Wn,;(An V An), and let us write % for 
@~'. Then Va,-4xA((¥a)x), and therefore also 


Vapayax| A((a)x) A px < Itha(y))]. 


Since the predicate Jx{ A((Wa)x) A px < Ith(n)] is a decidable predicate of 
n for n & T’, we can apply FAN)(T’) and find a z such that 


Vapdy < z4x|A((¥a)x) A px sy]. 
Since x < ox for all x, also Va;dx < zA(ax). O 
REMARK. The proposition also applies, mutatis mutandis, to FAN*. 


7.6. As we already pointed out, ECT, + MP implies continuity of map- 
pings from N™% to N. On the other hand, FAN, is compatible with C-N' but 
conflicts with CT. This is a corollary of the following theorem due to 
Kleene (1952A; see also Kleene and Vesley 1965, lemma 9.8). 


THEOREM. Let a range over recursive 01-sequences, n,m over finite O1-se- 
quences. There is a recursive (in fact even primitive recursive) predicate R 
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such that Vnm(R(n*m) > Rn) and 
VaIxiR(ax), (1) 
Vxdn(Ith(7) = x A Rn). (2) 


Proof. Let T be Kleene’s T-predicate and take for R 
R(ax) = Wy < x[(au < x(Tyyu A Uu = 0) > ay = 1) 
A (du < x(Tyyu A Uu = 1) > ay = 0)]. 


Then (2) is obviously satisfied: given x, take the recursive 01-sequence a 
such that 


(ay =1) eo (y <x A du < x(Tyyu A Uu =0)). 


As to (1), let a = {y} and suppose Tyyu. Now assume R(a(u + y + 1)). 
There are two possibilities, Uu=1 or Uu=0. If Uu =0, then du < 
u+y+ 1(Tyyv A Uv = 0). Hence ay = 1, so { y}(y) = 1; contradiction. 
Similarly Uu = 1 yields a contradiction. Therefore ,R(a(u+y+1)). O 


COROLLARY. FANg is incompatible with CT. 


PROOF. {n: Vm < nRm} is a recursive tree well-founded w.r.t. all recursive 
01-sequences; but by (2) there is no upper bound to the length of the 
branches in the tree. O 


REMARK. Any a@ such that VxR(ax) separates {y:3y(Tyyu A Uu = 0)} 
and { y:du(Tyyu A Uu = 1)}, our standard example of a pair of disjoint, 
recursively inseparable sets. A more simple-minded definition of a recur- 
sively well-founded, not well-founded tree would be R*(ax):= Vy < x 
(ay =1 © JAu(Tyyu A Uu = 0)), but R* is not decidable. 


7.7. On the intuitive motivation of FAN p and FAN. We discuss FAN) (7 )) 
and FAN(7p,) for the binary tree 7), of 01-sequences; as we have seen in 
7.5, this involves no loss of generality. So let n,m range over finite 
01-sequences and a, B over infinite ones. For any decidable predicate A 
such that VadxA(ax), A* = {n:Wm < n—Am} is well-founded. That is 
to say, if we follow any path through the binary tree (follow an arbitrary 
01-choice sequence) we always meet the bar P = {n: An A Vm < n—Am}. 
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As an illustration, we have drawn the picture of the bar associated with 
A(n) = Ith(n) > 20+: (the black nodes represent the bar). 


We can build a class 7 of well-founded binary trees using the following 
two principles 


{( )} EF, (1) 
if 7), T, belong to 7 then so does 
2(T), Ty) = ((O)enin Ty} U (anime T}U{()}. (2) 


The fact that a tree is to be put in J on account of (1) and (2) only (in 
other words, 7 is the least class satisfying (i) and (ii)) can be expressed by 
the following principle 


{{ )} ELA WX X,(% ELA X, EH 3X, X) EF) 
>(F7c#) (3) 


where X,, X; range over sets of finite sequences. 

It is easy to see that each element of 7 has finite depth (take for X € ¥ 
in (3): X is a binary tree of finite depth), i.e. there is a maximum length for 
the branches, for any tree from .7; and conversely, a binary tree of finite 
depth obviously belongs to 7. The class 7 represents an inductive notion 
of well-foundedness; let us simply call the trees in Z inductive binary 
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trees. FAN) identifies ordinary well-foundedness and inductive well- 
foundedness for binary trees. 

Indeed, it is hard to see how one can establish that a tree is well-founded 
with respect to arbitrary paths (i.e. paths about which nothing specific is 
assumed) without generating the tree inductively. The conflict of FAN and 
CT shows, on the other hand, that it is essential that we think of the paths 
as “arbitrary”: about recursive paths we have too much information. 

In other words, FAN, strengthens the interpretation of VaaxA(ax) for 
decidable A to “A* is inductively generated”; the latter is a purely arithmet- 
ical notion, since 7 is an r.e. set (recall that finite trees can be coded by 
natural numbers). If we strengthen the interpretation of this particular 
combination of quantifiers, it is not quite obvious that all the rules of 
intuitionistic predicate logic still apply (see also the discussion in 6.9). A 
verification that Va4x can actually be strengthened as indicated by FAN, 
while retaining the laws of intuitionistic predicate logic is provided by the 
method of elimination of choice sequences, a special case of which is being 
discussed in section 12.3. 

FAN states a stronger assumption: if Va4dxA(@, x), then the tree A* = 
{n:Wm < n—Am} is contained in an inductive binary tree. It seems most 
natural to motivate both FAN and FAN* by combining C-N and FAN). 


8. Bar induction and generalized inductive definitions 


Knowledge of this section is not necessary for understanding the chapters 
5—8 on intuitionistic mathematics. However, bar induction has played a 
very important role in Brouwer’s development of intuitionistic mathematics 
and occupies a prominent place in the literature, and the intuitionistic 
motivation for accepting the fan theorem is best understood by studying the 
reasons for accepting bar induction. 

In the preparation of our discussion of bar induction, we first introduce 
the inductively generated neighbourhood functions and trees, important 
special examples of generalized inductive definitions. After that, we turn to 
bar induction; and finally we return briefly to the-subject of generalized 
inductive definitions. The latter part may be skipped by a reader who is 
only interested in bar induction and its connection with the fan theorem. © 


8.1. Let T be any tree; then we can define 
T™:= {n:VWm<n(meT)}. 
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This tree is regular, i.e. it has the property 
(x is a terminal node) V Vx(n*(x) € T7). 
Conversely, each regular tree T’ determines a T such that T~ = T’ by 
nET=HnteOeT. 
A tree T is said to be well-founded iff each sequence “leaves the tree”, i.e. 
Vadx—(ax € T). Clearly T~ is well-founded iff T is well-founded. T~ \T 
consists of all terminal nodes of T~ , if T is well-founded; T~ \T is a bar 
of T, i.e. for each a we can find an x with ax € T~ \T, Wn < ax(n & T). 
Let Ky be defined as in 6.8. With every y € K, there is naturally 
associated a well-founded tree 
TY:= {n: yn =0}, 
a bar 
BY = (T’)7\TY, 
and a function 6 given by 
6n = yn — 1. 
Conversely, y can be recovered from BY and 6 by 
yn = 0if Im € BY(n<~m), 
yn =8m+1ifme BYAmsn., 
Let Kj C Ky consist of the neighbourhood functions uniformly bounded 
by 1, ie. 
Kg = {An.sg(yn): y € Ko}, 
then K% corresponds bijectively with the set of well-founded regular trees 
and hence with the set of well-founded trees. 


8.2. Generating the regular well-founded trees. How can we generate all 

regular well-founded trees? Certainly the following two clauses generate 

regular well-founded trees: 

1°) the single node tree {{ )} is regular and well-founded; 

2°) if To, T;, T>,.-. is an w-sequence of such trees, we can combine them 
into a new regular well-founded tree T, pictorially represented by 


(O)*Ty XI)*Ty) X2)*Tef -. \YkD*Ts 
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that is to say, 
T= BLT, = ((x)*nin€ 7.) U{( )}. 


If a regular well-founded tree is generated by repeated application of 1° and 
2° only, we shall call it an inductive tree; let us use ¥%7 for the class of 
inductive trees. 

1° and 2° above may now be reformulated as 
(i) (J FF, 
(ii) Vx(T, € 47) > 2T, € IT. 
ST is also the least class satisfying (i) and (ii), since we only put something 
in ¥7 by virtue of its construction from (i) and (ii); so, if & is any class of 
sets of finite sequences satisfying (i) and (ii), that is to say 


{( )} 2, Wx(X,€ 2%) ~ 2X, € &, 
then also 


ITC ®, 


8.3. Generating neighbourhood functions. #7 can equally well be described 
via the inductively generated elements of Kf; let us write K* for the set of 
these elements. K* satisfies two closure conditions parallel to (i), (ii): 

(i)* Ax.1 € K*, 

(ii)* Vx(y, © K*) > y € K*, 

where y satisfies y({ )) = 0, y((x)*n) = y,(n), or equivalently 


Ax.1 € K*, 
(x > = 0A Vx(An.y((x)*n) € K*) > y © K*. (1) 


K™* is to be the Jeast class of functions satisfying these closure conditions. 
So if we put 


A*(Q, a) = (a =Ax.1) V (af ) = 0A Wx(An.a((x)*n) € Q), 


then a moment’s reflection teaches us that Va(A*(Q, a) — Qa), being 
equivalent to Va((a=Ax.1l ~a€Q)A (ay )=0A Vx(An.a((x) *n) € Q) 
— a € Q)), expresses closure of Q under the closure conditions for K*, 
and thus the fact that K* is the least class satisfying these conditions is 
adequately expressed by 


Va( A*(Q, a) > Qa) > (K* CQ). 


8.4. Only slight changes are necessary to characterize the set K of induc- 
tively generated elements of Ko, i. those elements a € Ky for which 
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An.sg(an) © K*. One has only to replace A* by 


A(Q, a) = 
Ax(a@ =An.x +1) V (a0 =0A Wx(An.a((x)*n) € Q)); 
then 
Kio A( Kia) = Kw, 
K3 Va(A(Q,a) > Qa) > (KC Q). 


The axiom K1-2 may be split into 
Kl An.x+1EK, 
K2 af ) =O A Wx(An.a((x)*n) © K. 


So the only difference between K and K* is, that for K we permit 
arbitrary non-zero constant functions to start with, instead of only Ax.1. 

The proofs of the following two propositions show how one can use 
K1-3 to establish facts about K. 


8.5. PROPOSITION. K C Ko. 


PROOF, Informally, this is rather obvious: K was, after all, introduced as 
an inductively generated subset of Ky. We shall give a more formal proof 
based on K1-3 here. 

We apply K3 with a © Ky for Q(a). We have to verify the premiss 
of K3. An.x + 1 © Kg is obvious; and if for all x Am.a((x)*m) © Ko, 
and a0 = 0, then also a © Ky. For let 8 be any sequence, and let AB = 
Ax.B(x + 1). Then B = (B0)* AB, and a( B(x + 1)) = a((B0) *AB(x)); 
now Am.a((B0)*m) € Ko, hence for some x a((B0)*AB(x)) # 0. So 
a( B(x + 1)) # 0. It is also easy to see that an # 0 > a(n *m) = an; we 
only have to note that an #0 means n= (x)*n’ for some x,n’, so 
a((x)#n’) #0, hence also a((x)*n’') = a((x)*n'* m) = a(n*m). This 
establishes the premiss of K3 for this Q,so KC Ky. O 


N.B. The proof, as it stands, tacitly uses a closure condition on the range 
of the sequence variables: if B is a sequence, so is AB. This condition is not 
necessary: for a proof which avoids this assumption, see E4.8.1. 


8.6. Uniform bounds on bounded subtrees. Let a < B = Vx(ax < Bx). The 
set {a: a < B}, for any fixed 8, is a collection of functions ranging over a 
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finitely branching tree, namely the tree T, given by 
n © Tp = Wx < Ith(n)((n), < Bx). 


As we shall see, for each a € K, the length of nodes in {n:an=OAn€ 
T,} is uniformly bounded on 7,, and so is {an ~ 1: an #0 An & Tp}. 
We introduce the class K, by the following 


DEFINITION. K,a := VB4zVy < B(a(yz) # 0) A Vam(an #0 > 
an =a({n*m)). O 


ProposiTIon. K C K,. 
Proor. Apply K3 with K,a for Qa (exercise). O 


As we just established in 8.5, K C Ky holds; and classically one also 
easily establishes the converse: 


8.7. PROPOSITION (classical logic + AC-NN). Ky C K. 


Proof. Assume y € K)\ XK, and let yn :=Am.y(n*m), then ~s= 
y © Ky\ K, so y({_)) = 0 (otherwise y would be constant and greater than 
zero, and therefore belong to K). For some x9 ¥,,) © Ko\K, for if 
Vx(¥x) © K), then also y © K. Repeating this we find successively 
X,,X,... such that 


Vas = Ky\ K, Vie icy = Ky\ K, ete. 


So there is a sequence a, with ai = x; such that for all y yz, © Kg\ K. 
But since y € Ko, there is a y for which y;,(¢ )) > 0, i.e. yg, is non-zero 
constant and therefore belongs to K; we have thus obtained a contradic- 
tion. O 


REMARK. The application of choice above actually takes the form of DC-N, 
but this is equivalent to AC-NN. The proof uses PEM at each step, and is 
therefore of no use to us in obtaining an intuitionistic justification of 
Ky = K. 


8.8. Intuitive motivation for K , = K. Brouwer’s so-called “bar theorem” 
amounts to the assumption K% = K* or equivalently Ky = K. The effect of 
equating Kj and K* is to equate two notions of well-foundedness for 
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countably branching trees. Thus FAN, is generalized, which, as we saw, 
amounted to equating two notions of well-foundedness for finitely branch- 
ing trees. 

As in the case FANp, the identification of Ky and K is plausible only if 
the range of the function quantifier in VB(a(Bx) > 0) in the definition of 
a © K, is to be taken to contain all choice sequences; if we have too much 
information about an arbitrary sequence in the range of the variable B, 
Ky = K is no longer plausible (cf. the conflict between CT and FAN, in 
7.7). 

Since Ky = K sharpens the interpretation of quantifier combinations 
Vain, one would like to have metamathematical confirmation of the fact 
that we can safely use all the laws of intuitionistic predicate logic. As in the 
case of the fan theorem, such a confirmation can be obtained via the 
method of eliminating choice sequences (cf. 12.4.8). 

Let us now derive the most important corollary of Ky = K, namely the 
fan theorem. 


8.9. THEOREM. 
(i) Ky = K => FANp; 
(ti) Ky = K + C-N = FAN*. 


PROOF. (i) By 7.5 it suffices to establish FAN) in the form 
Vn(An V —An) A Wa < BAx A(ax) > 3z2Va < BIAx < 2 A(ax). 


(In fact, even the special case B = yx.1 suffices.) Let I, be given by 
T,(a@) = Ax.min(ax, Bx), then for a < B Iza = a, and Ipa < B for all a. 
Assume Wn(An V —An), Va < BaAxA(Ex), then also Wax (Tya(x)). Put 
A*(ax) = A(Ta(x)). By AC-NN! we find a y such that 


yn =1 04m <n(A*mA Vm < mA*n’). 


Obviously y € Kg since VadxA*(ax); by proposition 8.6 Ky = K C K,, so 
we find a z such ‘that Va < B(y(az) # 0), and thus also 4zVa < BAx <2z 
A(ax). 

(ii) Combining (i) with the fact that FAN, + C-N yields FAN*. © 


8.10. Bar continuity. C-N may be combined with K, = K into a single 
schema 


BC-N VaanA(a,n) > Jy © KVaA(a, y(a)), 
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or equivalently 
BC-N’ VadnA(a,n) > Jy € KWn(yn > 0 > Va En A(a, yn ~ 1)). 


The remainder of this section may be skipped at a first reading. The 
subsections 8.11—14 give some basic logical relationships between K, = K 
and several forms of bar induction which appear in the literature. 

8.15-17 give some supplementary comments on generalized inductive 
definitions, of which the introduction of K is a typical example. 

Finally, in 8.18 we give a concise discussion of Brouwer’s longer argu- 
ment for the bar theorem. 

The axiomatic basis in 8.11-14 is the system EL as described in 3.6.2. 


8.11. Bar induction. (Decidable) bar induction is the following schema 
BI, VadxP(ax) A Vn(Pn V +Pn) A Wn(Pn- Qn) A 
Vn(vyQ(n*(y)) > On) > Q ). 


BI, may be viewed as an induction principle over trees. From Wn 
(Pn V 4Pn) and Va3xP(ax) we see that {n: Vm < n—Pm} is a regular 
well-founded tree; the set {n: Pn A Wm < n—Pm} is a bar for this tree. 
From Wn(Pn > Qn) and Vn(VWyQ(n*(y)) > Qn) we see that Q is a 
property which holds on a bar and is transmitted upwards: if Q holds for 
all immediate successors of a node n, it holds for n. Now BI, permits us to 
conclude that then Q holds for the empty node. 
A more general version of BI, is monotone bar induction 


BIy  WadxP(ax) A Vnm(Pn > P(n*m)) A 
Vn(WyP(n*(y)) > Pn) > PC ). 


As we shall see below, on assumption of C-N BI) and BI,, are equivalent. 
It should be noted that the following apparently stronger version 


BIy,  WadxP(ax) A Vnm(Pn > P(n*m)) A Wn(Pn > Qn) A 
Va(WyQ(n*(y)) > Qn) > Q ) 


follows from BI: assume the premiss of BI\,, put P’n = VmQ(n * m) and 
apply BI, with respect to P’. 


8.12. ProposiTION. Let EL(K) be EL with the language extended with a 
predicate letter K. In ELK) + K1-3 we have the principle of induction over 


230 Non-classical axioms [Ch. 4 
unsecured sequences 
TUS Ka-> 

[vn(an #0 > Q’n) A Wn(VyQ'(n*(y)) > O'n) > OY DI. 
PRooF. Apply K3 to 

Qa = Vm[Vn(an #0 > O'(m*n)) A 
Vn(WyQ'(m*n*(y)) > O'(m*n)) > O'm]. 

ol oe Va © KQ(a), hence for any a € K VmQ'm, and so a fortiori 
eo». a 


8.13. PROPOSITION (logical relationships concerning BIp, BIy). Let EL(K ) 

be as in 8.12. 

(i) In EL(K) + K1-3 + AC-NN! we have K, = K @ BIp; in fact K is 
definable in EL + BI, and satisfies K1-3. 

(ii) In EL + C-N we have BIp = BI. 

(iit) In EL(K) we have BI, + C-N = BC-N. 


PROOF. (i) Assume EL(K) + K1-3 + AC-NN! + Ky = K, and let the hy- 
potheses of BI, be fulfilled. By AC-NN! there is a y such that 
yn =00Vm<n—Pm, Wn(yn <1); 
clearly y © Ko, hence y € K. For Q’n = Qn V Im < nPm one has 
Vn(yn #0 > Qn’), Wn(WyQ"(n*(y)) > Q'n) 


and applies IUS (8.12). 

Conversely assume EL + BI); we can now prove that K, satisfies the 
axioms K1-3 for K. It is obvious that Kg satisfies K1—-2 (this does not 
require BI,,). Let Q’ be any predicate such that 


Va(Ax.n+1€Q’), 
a0 = 0 A Wx(An.a((x)#n) € QO’) pa EQ’, 

then we have to show y € K, > y € Q’. So let y © Ko. Take 
Pn = yn + 0, 

i.e. Pn holds iff y, = Am.y(n * m) is constant, non-zero, and let 


On= 7, € 0. 
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Then Wn(Pn V 4Pn), VWn(Pn — Qn), Vadm(Pam), and obviously 
Vn(WyQ(n *(y)) > Qn); by BI, immediately Q( ), hence y € Q’. 

Clearly, since K3 implies K to be the /east predicate satisfying K1-2, we 
see that by the same reasoning in EL(K) + K1-3 + BI, we must have 
K = K,. N.B. In this direction we do not need AC-NN!. 

(ii) We leave BI,, = BIp as an exercise. For the converse, assume C-N, 
BI, and the hypotheses of BI, i.e. VadaxP(ax), Vam(Pn > P(n*m)), 
Vn(VyP(n*(y)) > Pn)). By C-N we can find a y € Ky such that Vn 
(yn # 0 > Va & nP(a(yn — 1))). Now put 


P’n:= yn #0 A Ith(n) > yn - 1, 
O'n := Pn. 


It is easy to see that P’, Q’ satisfy the hypotheses of BIp; thus Q’( ), ie. 
P< ) follows. 
(ili) Immediate via (i). O 


8.14. CoroLLaRy. In EL(K) we have BC-N = Ky = K. 
PRroor. Combine (iii) and (i) of the preceding proposition. O 


8.15. Generalized inductive definitions. In 7, K and K* we have encoun- 
tered examples of so-called generalized inductive definitions (g.i.d.’s), “gen- 
eralized”, since the type of closure conditions used to generate them 
generalizes the “finitary” closure conditions considered in section 3.6. 
Namely, from a countably infinite collection of elements of the inductively 
generated set one constructs a new element of the set; e.g. for ¥7 we had 
the condition: if for allx EN T, € #7, then 2,7, € FT. 

An example of a g.i.d. which is familiar from recursion theory is provided 
by the Kleene recursive ordinals © with partial ordering <,. <, is 
generated as the least relation satisfying 


0 <p, j(1,0) (/(1,0) # 0 is assumed), 
X <o y> x <o JQ, ¥), 
x <o YY <o i(l, y), 


a 


x <o{y}(z) A V2({y}(z) <o{y}(z + 1)) > ¥ <o JQ, y), 


and 0= {x:x =0V0<, x}. 
In this example © and <, are introduced as new predicates relative to 
the language of arithmetic. Once <, has been accepted as well-defined, we 
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may iterate the procedure: we can introduce <,, as the least relation 
satisfying the closure conditions for <, with <, replaced by <j, , and 
in addition 


xEONx <o{y}(z) A VWz,2’EO(z <Q 2' > 
{y}(2) <0 {9 }(2)) > x <o JG, y). 


This is an example of an iterated g.i.d.; that is to say, iterated relative to the 
language of arithmetic (it is a simple g.i.d. relative to the language of 
arithmetic extended with <,). 

Generalized inductive definitions play an important role in definability 
theory, recursion theory and proof theory (cf. Aczel 1977, Buchholz et al. 
1981). It is beyond the scope of this book to enter any further into the 
subject, except for one interesting observation. In section 3.6 we replaced 
(finitary) inductive definitions by explicit definitions in arithmetic via the 
notion of a finite (labelled) tree. The class of finite labelled trees is itself 
inductively definable, but in arithmetic we can straightforwardly replace 
induction over finite trees by ordinary induction over the depth of such 
trees. Thus the idea of the explicit definition consisted in a reduction of 
inductive definitions defined via A €# (3.6.5) to a certain standard 
inductive definition, namely the class of finite labelled trees. 

In the same manner we can reduce a wide class of g.i.d.’s to K or K*, 
which represent countably infinitely branching trees; that is to say, many 
g.i.d.’s can be defined explicitly in terms of K or K*. We state the precise 
result below, without proof. 


8.16. DEFINITION. Let P= Y(EL), and let #(X) be Y with an ad- 
ditional predicate letter X for a set of functions. The class #! of strongly 
positive formulas A(X,a) in £(X) consists of those A(X, a) which can be 
constructed from formulas of & and formulas Xp (@ a function-term of 
YL) by means of A, V,Vx,4x,Ja. O 


Note that strong positivity guarantees monotonicity 
A(P,a) AP CP’ >A(P’,a@), 
which can be proved purely logically. 


8.17. LEMMA. Assuming AC), and arithmetic any strongly positive A(P, a) 
is equivalent to an A°(P, «) of the form 


IBVx|R(a, B, x) V (Q(a, B, x) A Pela, B x])I, 
where R,Q, are in £, a function-term of #. O 
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THEOREM. Let IDB, be the theory obtained by adding to EL( K) the axioms 
K1-3 and ACo,, and let A(P, a) be strongly positive in P. Then we can 
explicitly define in IDB, a predicate P, such that 


Va[A(P,,a) > P,al, 
Val A(Q, a) > Qa] > Va[ P,a > Qa]. 


Proor. Along the same lines as the proof of 3.6.11, using the lemma. O 


8.18. Brouwer’s longer argument for the bar theorem (digression). Let us call 
a set P of finite sequences a bar if Vaax(ax € P). More generally, we shall 
say that n is barred by P (or nis P-barred ) iff Va © nax(ax © P). Where it 
is understood which set P is meant, we simply write “n is barred”. 

The basic assumption underlying Brouwer’s proof (1927), reiterated by 
him for example in (1954) is that from any proof of “({ ) is barred” one can 
always extract an “infinitary proof-tree” which is based on three types of 
inference only: 

(I) né&P, hence n is barred (immediate inference); 

(D) for all x, n*(x) is barred, hence n is barred (downwards inference), 
(U) n is barred, hence n * (x) is barred (upwards inference). 
Traditionally these inferences are called y-, f- and p-inferences respectively 
(f is the symbol for the obsolete greek letter wau). 

Under suitable assumptions on P, the U-inferences are eliminable. This 
is the case, for example, if P is monotone (ie. n © PAn<m—>meéP) 
or decidable (cf. E4.8.10). The following example essentially due to Kleene 
shows that the U-reférences are not eliminable for arbitrary P (Kleene and 
Vesley 1965, section 7.14; see also Dummett 1977, pp. 98-99). Take 


P= {( ):3x(Bx =0)} U ((x) : Bx # 0}; 


P is obviously a bar, since for any a we only have to check whether 
B(a0) # 0 or not. If B(a0) # 0, then (a0) = a1 € P, and if B(a0) = 0, 
then a0 = ¢ ) © P. The following “infinitary proof tree” may be extracted 
from this argument: 


Bn=0 
I 
BO #0 £1 #0 B2+#0 « ) is barred 
I | I I U 
(0) is barred (1) is barred (2)isbarred --- (n) is barred --- 


ee 


« ) 1s barred 
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Note that P is decidable iff we can decide 4x( Bx = 0). If we know that 
3x( Bx = 0), we can replace the infinitary proof tree by 


Bn=0 
I 
« ) is barred, 


and if we know that 44x( 8x = 0), the infinitary proof-tree uses only I- and 
D-inferences. But as long as 4x(8x = 0) is unknown, we cannot indicate 
which tree without U-inferences is appropriate; in particular, we cannot 
indicate whether “(0) is barred” appears in the tree or not. 

The same example can be used to show that the following (classically 
unproblematic) form of bar induction 


VadxP(ax) A Wn( Pn > Qn) A Wn(WyQ(n*(y)) > Qn) > OX ) 


is not intuitionistically acceptable (cf. E4.8.11). One also readily sees that 
Brouwer’s assumption plus the eliminability of U-inferences in, say, the 
monotone case leads to BI y. 


9. Uniformity principles and Kripke’s schema 


This section will be devoted to two principles which are of considerable 
interest because of the underlying intuitive motivation, but which will not 
play an important role in the mathematical developments in this book. The 
reader may therefore decide to skip this section until needed; the principles 
under discussion, the uniformity principle and Kripke’s schema will turn up 
afterwards only in asides, digressions and exercises. We turn first to the 
uniformity principle. 


9.1. The uniformity principle for numbers. This may be stated as a schema 
UP WX4n A(X, n) > AnVX A(X, n) 


(X ranging over sets of natural numbers). The idea behind this principle is 
as follows. Constructively, the sets of natural numbers are a very “diffuse” 
totality, at least from an extensional point of view: given any two sets X, Y 
one can easily construct a third one which cannot be shown to be distinct 
from either of them: 


xE€Z=e{xEXAF)V(xEYA-F)}, 
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where F is some (as yet) undecided statement. Thus, if one has assigned to 
each X C N a natural number, i.e. an element of a “very discrete” set, the 
only imaginable form of such an assignment seems to be a constant 
function. In fact, the following weaker form of the uniformity principle 


UP! = WX4!n A(X, n) > InVX A(X,n) 


(equality between subsets of N being interpreted extensionally) is easily 
proved on assumption of 


VP(AP V =P), 


where P ranges over propositions. 
To see this, assume A(X, n), A(Y, m) and define Z, by 


x€Zp={x:(x EXAAP)V(xXEYAP)}, 


then if n # m, and A(Zp, k), we find Y# X andk#m—>-+P,k=m- 
—iP, and thus VP({P V ——4P). Therefore n = m and so VY A(Y, n). 


9.2. The uniformity principle for functions. This may be stated as a schema 
UP,  WXda A(X,a) > JaVX A(X, a). 


The motivation is similar to the one given for UP: even if functions (in 
particular: choice sequences) need not be completely determined, their 
initial segments can be determined; that is to say, N™ is a separable totality 
which is far less “diffuse” than the collection of subsets of the natural 
numbers. Note also that quite trivially 


CT A UP = UP, 


since by CT the function quantifier in UP, reduces to a number quantifier. 
On the other hand, UP,!, with VX4!a instead of VX4a in the premiss does 
not give us anything new: UP! > UP,!, because of VX dla A(X, a) @ 
VxV XA! ydal A(X, a) A ax = y]; assuming AC-NN for decidable predi- 
cates, this gives us UP,! from UP!. 


9.3. Parameterization principles and Kripke’s schema. We shall say that the 
subsets of N are parameterized by functions, via a (fixed) predicate A, if the 
following schema holds 


PF, WXdaVn[n€ Xe A(a,n)]. 
Classically, PF, holds for a particularly simple A: take A(a, n) = (an = 0). 
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If a is a parameter for X, i.e. Vn[n © X © an = 0], @ is a characteristic 
function for X. 

Certain “creative subject” arguments by Brouwer, to be described below, 
lead to the assumption of “Kripke’s schema” 


KS  VWXaaVn[n € X @ Im(a(n, m) = 9)], 


so KS is PF, with A(a, 2) = dm(a(n, m) = 0). 

Brouwer’s “creative subject” arguments may be viewed as an extreme 
expression of Brouwer’s solipsistic view of mathematics. In the theory of the 
creative subject (= idealized mathematician; IM for short) the creative 
subject is viewed as carrying out his activity in w discrete stages. Let us 
write “O,, A” for: “the IM has evidence for A at stage n of his mathemati- 
cal activity” (elsewhere in the literature one finds +, for 0, ). 

It is assumed that at any stage the IM knows whether he has (enough) 
evidence for A or not, so 


IM1 (GO, A) v —(0,,4). 
Secondly, evidence is supposed to be cumulative: 
IM2 (O,,4)An>m- (dO, A). 


Finally, the solipsistic principle is assumed: true is, for the IM, only what 
he discovers to be true in the course of his own mathematical activity: 


IM3. A © 4n(0,, A). 


Using IM3 as a mathematical insight by the IM is an instance of self-reflec- 
tion. As an example of the use of IM3 in the construction of mathematical 
objects, consider a constructed as follows, for any given X CN: 


O,,(1 € X) > a(n, m) = 0, 
(0,,(n € X)) > a(n, m) = 1. 


Clearly, n € X @ Im(Q,,,(n € X)) @ Am(a(n, m) = 0), and this justifies 
KS. Classically of course, KS is immediately obvious-it is a weaker 
statement than the existence of a characteristic function for each X CN. 
The theory of the creative subject gives rise to considerable conceptual 
difficulties, and has generated a fair amount of discussion in the literature, 
but a critical appraisal is beyond the scope of this chapter; we shall have 
something more to say about the “creative subject” in section 16.3. 
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The following proposition relates KS, continuity, CT and the uniformity 


principles: 


9.4. PROPOSITION. 
(i) KS + WC-N => UP. 
(ii) KS > —UP,, CT, =CONT,,. 


PROOF, (i) Let VXInA(X, n); take for X = {m:4n(a(m, n) = 0)}, then 
vaak A*(a, k), 

where A*(a, k) = A({n:dma(n, m) = 0}, k). By WC-N 
Vaaknvp € an A*(B, k). 

Take a = )x.1 and let k, n be such that 
VB & (Xx.1)n A*(B, k). (1) 


Now if m € X @ dn(a(m, n) = 0), then for n’ = (Xx.1)n, there is an a’ 
such that n’* a’ is also a parameter for X, hence from (1) VX ACX, k). 
(ii) apply UP, to KS: 


JaVX Wn(n € X & Am(a(n, m) = 0)), 


which is obviously false. We leave the proof of the other two statements to 
the reader. O 


9.5. Kripke’s schema and Markov’s principle. Brouwer (1948) introduced 
the explicit reference to the activity of the ideal mathematician in a 
mathematical construction in order to obtain a weak counterexample to 
x #0 -— x #0 for a real number x. As we have seen (5.4), Vx € R(x #0 
— x # 0) is equivalent to Markov’s Principle, and in an axiomatic setting 
Brouwer’s example leads to a refutation of MP from KS: 


PROPOSITION. KS + MP = PEM. 

Proor. Let 8 be any function and apply KS to A = B V —B; by KS we 
find an a such that Sy(ay = 0) © A, and since =A holds we have ——4y 
(ay = 0), and thus by Markov’s principle Jy(ay = 0), ie. BV AB. O 


Thus already the weakest “non-classical” assumption {VP(P V AP) 
suffices to obtain a contradiction between KS and MP. 
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Kripke models for second order arithmetic, as introduced in 3.8.8 satisfy 
the uniformity principle at least partially, as shown by the following 


9.6. PROPOSITION. Let X be a Kripke model for HAS as described in 3.8.8, 

with underlying tree (K, <). Then, 

(i) If (K, s) is the full binary tree or the universal tree, then UP! holds 
in X; 

(ii) Jf (K, <) is the universal tree, then X satisfies closed instances of UP, 
i.e. those instances of UP where A does not contain free set variables 
besides X. 


PRooF. We leave (i) as an exercise; the idea is to give a Kripke model 
version of the argument in 9.1. ; 

(ii) Assume k t+ VX3AnA(X, n), k  AnVXACX, n), A not containing 
free set variables besides X. By the second assumption there are fam- 
ilies “ € D, for each m EN, such that k ¥ A(L%,n). Suppose F = 
{S,,:k © K}, and define Y= {S,: k © K} by 


S,:= 2 if A(k’>k), 
Skenyek = Sn kek 


By the first assumption k I+ A(Y,m) for some m, and therefore also 
k*(m) It ACF, m). 

Now the restriction #’ of # to nodes k’ => k*(m) is isomorphic 
(modulo duplication of some elements in D,) to the model restricted to 
nodes k’ = k; and in #’ ¥ corresponds to ¥&,. Since A does not contain 
set variables besides X, it follows that since k*(m) I+ A(Y, m), also 
k i+ A(X, m), contradicting our second assumption. Therefore k tt 
anVXA(X,n). O 


10. Notes 


10.1. Axioms of choice. Probably the first place where an axiom of choice 
explicitly appears in formalizations of intuitionistic mathematics is Kleene 
(1957) (ACg9); Kleene and Vesley (1965) introduces AC,). Kreisel’s for- 
malization of the theory of choice sequences (1963, and sections 2.5—6 of 
1965) also contains choice axioms. The presentation axiom appears in the 
context of constructive set theory in Aczel (1978). 

Further information about logical relationships between versions of the 
axiom of choice and dependent choice in the context of formal constructive 
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analysis can be found in Howard and Kreisel (1966) and in Kreisel and 
Troelstra (1970). 


10.2. Church’s thesis. The original “thesis” was not formulated in the 
context of constructive mathematics, so strictly speaking, the designation 
“Church’s thesis” for the principles discussed in section 3 is incorrect, but 
since the terminology is generally accepted in discussions of constructivism 
we adopt it here too. A thorough discussion of Church’s thesis in the 
context of constructivism may be found in Kreisel (1970, 1972). Church’s 
thesis, in an appropriate formulation, has been shown to be consistent with 
most intuitionistic formalisms studied in the literature. “Appropriate” 
means here that e.g. in CT the @ ranges over lawlike sequences only, not 
over choice sequences, and that in CT, the predicate A should not contain 
choice parameters. See also Webb (1980), Shapiro (1981). 


10.3. Realizability. Realizability by numbers was formulated by Kleene 
(1945); his original version was the non-formalized one. The formalized 
treatment of realizability for intuitionistic arithmetic was carried through 
by Nelson (1947). 

In his retrospective survey Kleene (1973) states that his inspiration for 
the notion was not the BHK-interpretation, but rather certain formulations 
in Hilbert and Bernays (1934, p. 32). 

In retrospect it seems clear enough that the ideas taken from 
Hilbert-Bernays and the BHK-interpretation are actually very close. More 
information on realizability can be found in Troelstra (1973 sections 3.2, 
3.3) and Beeson (1985). 

In Kleene (1960) the self-realizing character of almost negative formulas 
was demonstrated; in Troelstra (1971) this fact formed the basis of the 
axiomatic characterization of the realizable formulas by means of ECTp. It 
is interesting to note that HA + CT,! # CTy (Lifschitz 1979) and HA + 
CT, # ECT, (Beeson, cf. Troelstra 1973, 3.4.14). 

Dragalin (1969) uses ECT, (5.5) to characterize true realizable formulas 
with the help of transfinite progressions. 

Kleene (1952, section 82) also considered a variant of realizability (+ - 
realizability) where in the clauses for V,4, ~ conditions involving formal 
deducibility were inserted, e.g. 


nv(AV B) iff 
(jn = Oand j,nrA and + A), or (j,n =1and j,nrBand + B) 
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etcetera; from this notion Kleene afterwards (1962) extracted the Kleene 
slash (3.5.3). 

For his realizability by functions (Kleene and Vesley 1965) Kleene 
introduced in (1969) the formalized version, together with the q-variant 
which is similar to + -realizability, except that formal derivability has been 
replaced by truth. Troelstra (1971) studies the corresponding, but simpler, 
q-realizability by numbers. q-realizability as defined in E4.4.7 is slightly 
different; the relationships can be expressed by “Kleene’s version of q-real- 
izability” : “q-realizability of E4.4.7” = Kleene slash: Aczel slash. It was 
Grayson who observed that the realizability of E4.4.7 did the same job as 
Kleene’s variant. The notion of E4.4.7 has the advantage of being closed 
under deducibility. 

More about realizability may be found in sections 9.5 and 13.8. 


10.4. Markov’s principle. First announced in Markov (1954), according to a 
quotation by Shanin (1958A) (cf. Kleene and Vesley 1965, section 10). 
Kushner (1973) gives an even earlier published source which could not be 
checked by us. 

Myhill (1963) gives an argument based on some general principles for an 
abstract theory of constructions refuting Markov’s principle. The argument 
bears some similarity to the refutation of Markov’s principle under the 
assumption of Kripke’s schema in 9.5. 

Logical relationships involving Markov’s principle are discussed in 
Luckhardt (1977, 1977A). Our PT in 5.3 corresponds to principle (a) in 
Luckhardt (1977). 

Information on the Dialectica interpretation may be found in Troelstra 
(1973, section 3.5), and in the “Introductory Note to Gédel 1958, 1972” in 
Gédel (1987). 


10.5. Choice sequences. Information on the history of choice sequences can 
be found in Troelstra (1977, 1982A, 1983); of these, (1977) and (1983) also 
contain a fairly complete bibliography on the subject. The lawless se- 
quences are discussed in much greater detail in chapter 12. 

The first formulation of continuity, corresponding to our WC-N, is found 
in Heyting (1930B). Kleene (1957) contains a combination of the fan 
theorem with continuity. In Kreisel (1963) (later summarized in Kreisel 
1965, sections 2.5-6) continuity appears in the form of bar-continuity 
(8.10). Kleene and Vesley (1965) is based on CONT, (Brouwer’s principle 
for functions) which specializes to C-N (Brouwer’s principle for numbers). 
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More information on logical relationships between various forms of 
continuity and choice axioms is contained in Kleene and Vesley (1965) and 
Kreisel and Troelstra (1970). 

Brouwer used continuity for choice sequences in lectures as early as 
1916-1917; the first clear-cut statements of continuity, fan theorem and bar 
theorem appear in Brouwer (1924). A very clear formulation of the bar 
induction principle is contained in footnote 7 of Brouwer (1927). 


10.6. Uniformity, parameterization and Kripke’s schema. The uniformity 
principle UP is explicitly formulated in Troelstra (1973A); before that date 
Kreisel (1971) drew attention to the fact that DP and ED for IQC imply 
remarkable “uniformity rules” for intuitionistic validity. More information 
on UP and UP! may be found scattered in the papers de Jongh and 
Smorynski (1976), van Dalen (1977, 1978) and Troelstra (1977A). 

As far as we know, the name “Kripke’s schema” first appears in Myhill 
(1967), and in Myhill’s contribution to the discussion in Kreisel (1967), for 
the following weak version 


WKS — da[(Ax(ax = 0) > A) A (4ax(ax = 0) > = A)]. 


WKS — —CT is due to Kripke (Kreisel 1970, Note 10; cf. also Myhill 1967, 
final paragraph). Myhill also mentions (1967, p. 295) the strong version but 
notes that it is not needed for the reproduction of Brouwer’s counterexam- 
ples. 

Kreisel (1967) contains the earliest axiomatization of the theory of the 
creative subject. In Kreisel’s axiomatization the possibility that there is 
more than one creative subject is left open: there is a parameter S ranging 
over creative subjects; the crucial principle corresponding to our IM3 there 
takes the form 


(A > VE—4m(Og ,, A)) A VE(Am(Os,,, A) > A) 


(the intention, not explicitly stated in the paper, is that A @ J2m(Qy ,, A)). 
This corresponds to WKS. Troelstra (1969) gives a survey of the theory of 
the creative subject. 

WKS is in general weaker than KS (Krol 1978A), and BI + C-N + KS 
is consistent relative to elementary analysis (Krol’ 1978), a fact which is far 
from evident from the intuitive justification of BI and C-N for choice 
sequences. 


10.7. The paper Troelstra (1977A) discusses some further “non-classical” 
principles such as subcountability of discrete sets (SCDS); little has been 
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published about these so far. McCarty (1986) shows that SCDS is con- 
sistent with a set theory based on intuitionistic logic (IZF of chapter 11). 


Exercises 


4.1.1. Prove lemma 1.4. 


4.2.1. If we assume enough set-comprehension as admissible, we can construct for each 
proposition A the sets a= {x: x =OV(x=1AA)} and b= {xi(x=OAA)VxX=1)}; 
clearly Vy € {a, b}4x(x € y). Derive PEM by application of AC to this premiss (Diaconescu 
1975, Goodman and Myhill 1978). 


4.2.2. Show that DC-N is equivalent to AC-NN, and show that AC-ND (i.e. AC-N for a 
specific domain D) follows from DC-D. Show also that any instance of RDC-D be derived 
from DC-D’ for suitable D’, where D’ may depend on the instance of RDC-D to be derived 
(Kreisel and Troelstra 1970, 2.7.2). 


4.3.1. Show that, assuming CTy, the following principles of classical predicate logic are 
constructively false by refuting universally closed instances of them: 


a-(P v Q) -—=4Pv HQ, 


a-adxd > 4x4, 
33xA —*, VxA, 
-Wxid-> dxA, 
Vx(A V B(x)) > AV VxB(x) (x € FV(A)). 
4.4.1. Complete the proofs of 4.5 and 4.6, and fill in the details of the remark at the end of 4.6. 


4.4.2. Complete the proof of 4,10. 


4.4.3. Establish the refinement mentioned in the remark at the end of 4.10: for each sentence 
A provable in HA there is a numeral 7 such that HA + 791A. Hint. Make use of the fact that 
only true L?-sentences can be proved in HA (see e.g. Troelstra 1973, 3.2.4). 


4.4.4. Show that the addition of the following schema 
ECTy!* 9 Waf An > 3tmB(n, m)] > 3kWn[ An > Im(Tknm A B(n, Um))} 


to HA is inconsistent. N.B. A is not assumed to be almost negative. This shows that, a fortiori, 
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the generalization ECT} of ECT) with condition “A almost negative” dropped is inconsistent. 
Hint. For the premiss of ECT) !* take Wn[(AmTnnm V —3mTnnm) > 3k((k = 0 A ImTnnm) 
V (k =1 A —=dmTnnm))] (Troelstra 1973, 3.2.20, remark). 


4.4.5. Let Ig be the class of formulas of HA such that for all subformulas A > B of aC € Ig 
A is almost negative. Show that HA + ECThy is conservative over HA with respect to formulas 
of Ig (Troelstra 1973, 3.6.6). 


4.4.6. Extend the result of exercise 4.4.3 to HA + ECTp (Troelstra 1973, 3.2.15). 


4.4.7, Define the notion of q-realizability as follows: the clauses (i)-(iii), (v)-(vi) in 4.2 are 
also adopted for q-realizability, with q everywhere replacing r; (iv) is replaced by 


xq(A > B) =Vy(yqA > Au(Txyu A UuqB)) A (A> B). 


Show the following 

(i) HAF 3x(xq A) A, 

Gi) HA + ECT) + A © Ax(xq A), 
(iii) HA + A = HAF 4x(xq A). 
(Cf. remarks in 10.3.) 


4.4.8. Use the result of exercise 4.4.7 to show closure of HA under Church’s rule 

CRg  HAt Vx3yA(x, y) = HAF 3zWxd(x, {z}(x)), 

(Troelstra 1973, 3.7.2). Use this result in turn to derive MR from MRpp (3.5.4). 

4.4.9. Show that proposition 4.5 also holds for q-realizability (E4.4.7) and use this to 
strengthen the result of E4.4.8 to closure of HA under 

ECR, F Wx[Ax > 3yBxy] =F 3zVx| Ax — E{z}(x) A B(x, {z}(x))]. 


(extended Church’s rule, Troelstra 1973, 3.7.2). 


4.4.10. Show that the refinement of E4.4.3 also holds for q-realizability; use this to give 
another proof of DP and EDN for HA (cf. 3.5.6-10). 
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4.4.11, Extend r-realizability to HAS (cf. 3.8.4) as follows. To each set variable X we assign a 
set variable X* such that X # Y = X* # Y*, and we add the following clauses 


xr(te X) = (x,t) © X*, 
xtWXA = WX*(xtA), 
xVAXA = 4X*(xrA). 


Prove the soundness theorem, thereby establishing consistency of HAS + CTy. N.B. Note that 
in fact one may choose X* = X. (Troelstra 1973, 3.2.31.) 


4.4.12. Extend q-realizability to HAS (3.8.4) as follows. We now use each set variable X to 
code two sets: Xp = {x:2x © X}, X= {x:2x+1 6X}. Now xqA is defined as xr A 
for HAS except for the following clauses 

xq(tE XxX) = (x,t) EX, 

xqdA> B=Vy(yqd > {x}(y)qQB) A (A > B)*. 
Here C* is obtained from C by replacing all occurrences : € X for numerical terms ¢ and set 
variables X in C by t € X, (or 21+ 1 © X). Prove the soundness theorem for sentences A: 
HAS F A = HAS + 71qA for some numeral 7%. Use this to obtain Church’s rule, EDN and 
DP for HAS (cf. Friedman 1977, and E4.4.7-10). 
4.4.13. Let PCT be “Partial Church’s Thesis” 

VadxVyz(Txyz @ my = Uz), 


where @ ranges over partial functions from N to N. Show that PCT conflicts with the 
following axiom of partial choice 


Vx © AFyB(x, y) ~ InVx € A(E(2x) A B(x, tx)) 
(compare E4.4.4). 


4.5.1. Show that 

(i) MPpp and MP are equivalent in the presence of CT); 

Gi) MP and PT are equivalent. 

Formulate a generalization PT’ of PT also permitting possibly empty sets and show 
(iii) MP and PT” are equivalent. 

((i) and (ii) in Luckhardt (1977). 


4.5.2, (Assuming familiarity with elementary recursion theory.) Show that MPpr is equivalent 
to 


Vxy [4A32Txyz > 32Txyz], 
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and even to 


Wx [-Ad2Txxz > 42Txxz). 


4.5.3. Show that ECT, and ECTy (5.5) are equivalent in the presence of MPpp. 


4.5.4, Show that, relative to HA, assuming the realizability of MP is equivalent to assuming 
MP itself. 


45.5. Let IP be the schema (4A > 4yB) ~ 4y(A4 > B) (y not free in A) and show 
HA + MPpp + IP + CTyp 1 (Troelstra 1973, 3.2.27(i)). 


4.5.6. Modify realizability for HAS (E4.4.11) by putting as clause for te X 
xr(te X) =—A3(x,1t) © X*. 
Check the soundness theorem for this variant and show moreover that “Shanin’s principle” 


SHP  vxXaYVx[x © Xe ay((y.x) € Y)] 


is realizable (Gordeev 1982). 


4.6.1. Show that WC-N actually is equivalent to the following principle: if {X,: 1 © N} is a 
covering of N™ by arbitrary sets, then {Int(X,,): 1 € N} also covers NN. 


4.6.2. Give an explicit definition of the neighbourhood functions for the following functionals 
inN™ >N: 


®,a = a(a0), 
®,0 = a0 + al + --- +a(ad), 
G0 = al + a2, 


,a = a(a(a0)). 


4.6.3. Show that AC-NN follows from NFP (6.12). Hint. Given A(x, y), consider A*(ax) = 
x >0A A(ad, x = 1). 


4.6.4. Derive C-N from WC-N + NFP. Hint. Use the preceding exercise, and apply NFP to 
Vad yIxVB € ayA(B, y) obtained from WC-N (6.12). 


4.6.5. Prove Lindeléf’s theorem (6.14) for NN from NFP. Hint. Let {W,:i © I} be an open 
cover of NN; apply NFP to Vaaxdi € I(V;, ¢ W;). 
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4.6.6. Refute the schemata of E4.3.1 assuming WC-N instead of CTh. 

4.6.7. Show that C-C implies C-N. 

4.6.8. Define neighbourhood functions (cf. 6.15) for the following functionals in NY > NN: 
Via n*a, 
Wya = ja + joa, 
Wa = Ax.&x, 


Wya = Xx.a(ax). 


4.6.9. Suppose that ¥, € NN > NN is given by y € Ky, and ® ¢ NN >N by 6 & Ky. 
Define explicitly from y and 8 a neighbourhood function for the composition @;« ¥%, € NN 
— N (Kreisel and Troelstra 1970, 3.2.11-12). 


4.6.10. Let ¥,, ®s be as in the preceding exercise. Show that there is no total A € NN x NN 
— N™ such that for neighbourhood functions y, 6 always %,°%, = Pq5,,, (Kreisel and 
Troelstra 1970, 3.5). 


4.7.1. Show in detail that, assuming classical logic, FANp implies Kénig’s lemma. 


4.7.2. For the functionals © mentioned in exercise 4.6.2 determine (a) their maximum value on 
To, and (b) their maximum value on the tree of sequences with values bounded by n. 

Determine also a least upper bound for the length of initial segments of infinite branches 
needed to determine the values required under (a) and (b). 


4.7.3. Show that FAN and NFP of 6.12 are classically valid. 
4.7.4, Give an example showing that the principle 
Va € Ty3xA(a, x) 2 IzVa € Ty4x < zA(a, x) 
is not compatible with classical logic. 
4.7.5. Prove 7.4ii). 


4.7.6. Refute 3-PEM by means of FAN*(7%,) without assuming the existence of lawlike 
elements among the 01-choice sequences, but only the existence of a Ol-sequence a € n for 
each n & Ty. 
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4.1.7. Show that CT implies that 2“ and N% are homeomorphic. Hint. Let Tp = {n:Vm <n 
Rm} be the Kleene tree as defined in 7.6. Let f be a function enumerating without repetition 
the terminal nodes of Tp. Then ¥ € N™ — 2% given by 


¥(a) = f(a0)* f(al)* f(a2)* f(a3)* + 
is a homeomorphism (Beeson 1985, IV, 13). 


4.8.1. Prove Va € KVBVnim(B En > (am >0AB En*m)) without assuming any 
closure conditions on the range of function variables (Kreisel 1968, p. 232). 


4.8.2. Prove K C K, (8.6). 
4.8.3. Complete the proof of 8.12. 


4.8.4, Prove 8.13(ii), namely EL + BI, + BIp in EL (Howard and Kreisel 1966, Theorem 
4A). 


48.5. Assuming K1-3, show Vy € KV6 € Ky[Vn(yn # 0 > 6n #0) > 8 € KJ. 


4.8.6. Show by induction over K that y © K > An.yn- sg(Ith(n) ~ yn) © K (use the preced- 
ing exercise). From this it follows that 


y@K—> Fy’ € K(y'n#0> yn =y'n A y'n < Ith(n)). 
4.8.7. Give a direct proof that BC-N => Ky = K, using the preceding two exercises (cf. 8.14). 
4.8.8. Prove the lemma and theorem in 8.17 (Kreise] and Troelstra 1970, section 4). 


4.8.9. Let « © K; define on the corresponding tree T, = {n*(m): an =0,m EN} k <*k’ 
:= k > k’ and show that transfinite induction holds w.r.t. <*, ie. Vy(Wx <* pF(x) > F(y)) 
— VxF(x) for x, y ranging over T,. 


48.10. Define P, and P, by a generalized inductive definition as follows. Put 
A(n, X) =n & PV Vy(n*(y) EX) V Azm(n = m*(z) AmEX), 


A,(n, X) =ne€ PV Vy(n*(y) © X); 
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then P,, P, are characterized by 


ae P,) md Pin, 
vn(Ai(1,Q) >n€©Q) > (Pc Q). 


The assumption in Brouwer (1927) underlying the proof of the “bar theorem” amounts to 
« ) is P-barred >= ( ) € P, 

and the eliminability of U-inferences corresponds to 
()ER = (EP. 

Show that for monotone P we even have P, = P,, and for decidable P 
Vn(n€@ Pi > ne P,V Iim~<n(me P)) 

(Martino and Giaretta 1981). 


4.8.11. Show that VaaxP(ax) A V(Pn > Qn) A Vn(WyQ(n*(y)) — Qn) + QC ) implies 
Jn(Bn = 0) V ~3n(Bn = 0), for P as in Kleene’s example (8.18) and suitable Q. 


4.8.12. Prove, assuming bar-continuity BC-N (8.10) plus a density principle Wnda(a € n) that 
~3J-PEM can be proved without assuming that lawlike functions are (extensionally) equal to 
choice sequences. 


4.8.13. Derive double bar induction from BI, where double bar induction is the following 
principle: 
VaBAxyP(ax, By) A 
Van'm(P(n,m) > P(n*n’,m) A Wnmm'(P(n,m) > P(n,m*m’)) A 
Wn(WxP(n*(x),m) > P(n,m)) AVm(VyP(n,m*(y)) + P(n,m)) 
> P(().¢)). 


4.9.1. Show that KS + CT, KS F {CONT, (9.4(ii), Myhill 1967). 


4.9.2. Prove 9.6(i); give a counterexample to 9.6(ii) for a non-closed instance of UP (de Jongh 
and Smorynski 1976, p. 163). 


4.9.3. Use r-realizability (E4.4.11) for HAS to establish consistency of HAS + UP + CTo, and 
q-realizability (E.4.4.12) to derive EDN and DP (3.5.6) for HAS + UP + CT. 
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4.9.4. Show that C-N + PF, imply UP, and that PF, contradicts UP,. This generalizes 9.4 
(Troelstra 1977A, section 5). 


4.9.5. Let H be any extension of EL + KS + AC-NF in the language of EL, and HS the 
corresponding extension with variables for sets of natural numbers, impredicative comprehen- 
sion and extensionality for sets added. Use KS to define a translation + such that HS + A = 
H+ 1(A), and HAF A = EL + KS + AC-NFE 1(A) (van Dalen 1977). 


4.9.6. Introduce functions in the models of 9.6 via their graphs, ie. Y is a function at node k 
if k & Wn4!m((n, m) € #). Show that on this interpretation of functions KS is false in the 
models (de Jongh and Smorynski 1976). 


CHAPTER 5 


REAL NUMBERS 


In this chapter we give a detailed constructive treatment of the theory of 
real numbers. The reader who wishes to arrive speedily at some examples of 
constructive mathematics may restrict himself, at a first reading, to the 
approach via fundamental sequences, which is developed in sections 2-4, 
and then continue with chapter 6. 

In sections 5 and 6 we describe the introduction of reals by means of 
Dedekind cuts; simultaneously with the Dedekind reals (R“) we treat the 
much larger structure of the (bounded) extended reals R°*, which turns out 
to be the uniquely determined order completion of Q. In R®* each bounded 
inhabited set has a supremum. A still larger structure is R°, which is in 
many respects similar to R in classical mathematics. 

We have chosen to introduce both the Cauchy reals and the Dedekind 
reals, even though on the basis of ACgg, a choice principle accepted by 
most constructivists, both structures can be shown to be isomorphic. One 
reason for this is that in chapter 15 we shall encounter some “natural” 
models for analysis in which ACoo fails, and where R and R@ are actually 
not isomorphic. In these models R@ is the more interesting structure of the 
two. Another reason is that the treatment of R®* and R¢ is most naturally 
given in parallel with a treatment of R¢. 

R4,R°°,R° have unique characterizations as ordered structures extend- 
ing Q (5.12). 

Section 7 contains two metamathematical digressions: by means of the 
Gédel negative translation it is explained why R* behaves nearly “classi- 
cally”, and it is shown how the distinctions between R*, R® and R° can be 
made visible in Kripke models for HAS. 


1. Introduction 


1.1. Integers and rationals. Given the natural numbers, the introduction of 
the integers Z and the rationals is unproblematic and can be carried out 
just as in traditional mathematics. 
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The elementary theory of arithmetical operations on Z and @ does not 
really carry us beyond arithmetic HA. To see this, note that we can 
enumerate Z and Q without repetitions, i.e. we can use natural numbers as 
codes for elements of Z and @. The operations +,- on Z and Q respec- 
tively correspond to certain arithmetical operations +,, -7 and +9,°Q 
which, if we have chosen our coding appropriately, are in fact primitive 
recursive; their basic properties such as commutativity, associativity etc. 
can all be proved in arithmetic. 

The coding permits us to think of the basic theory of Z and Q as 
formulated in a definitional extension of intuitionistic first-order arithmetic 
HA. Obviously, then, we can appeal to the Gédel-Gentzen translation (cf. 
3.3.4) to see that all the usual identities and inequalities are provable 
classically iff they are provable intuitionistically, for example the triangle 
inequality on Q: 


Vr, s © Q(|r + s| < |r| + [s]). 


The reader who worries about the introduction of equivalence classes in the 
traditional definitions of Z and Q@ should take into account that the 
equivalence relations under consideration are decidable and that hence 
the forming of equivalence classes is unproblematic. One can also avoid the 
introduction of equivalence classes by sticking to the original objects (pairs 
of natural numbers in the case of Z) and letting the defined equivalence 
relation play the role of equality; this is the solution chosen by Bishop 
(1967) for example. 

The coding of Z and Q in N, as indicated above, avoids these problems 
altogether, at the expense of a more involved explicit definition of the 
arithmetical operations on Z and Q via the codes. 

For all practical purposes we will stick to the traditional conception of 
integers and rationals as equivalence classes. 


1.2. Introducing real numbers. In classical mathematics there are several 
methods for introducing the reals. The two principal ones are: 

(a) Real numbers as equivalence classes of fundamental sequences i.e. 
Cauchy sequences of rationals; this method goes back to G. Cantor. The 
method of nested intervals is closely related; the filter completion method is 
more general, but in the same spirit. 

(b) The method of Dedekind cuts in Q. Here reals appear as cuts, i.e. 
subsets of Q with special properties. The axiomatic variant of this method 
does not define reals as cuts, but instead postulates closure of the number 
system under the least-upper-bound principle. 
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Method (a) generalizes to the well-known procedure for constructing 
metric completions of metric spaces, method (b) to the construction of 
order-completions of lattices. In the context of constructive mathematics, 
both (a) and (b) can be utilized. (a) can almost literally be copied from 
classical treatments, in the case of (b) some extra attention has to be paid to 
the formulation. 

So far we have not mentioned the infinite decimal fractions, a very 
special case of method (a). The reason is that this method is too narrow. 
Already Brouwer (1921) observed that not all Cauchy reals have decimal 
expansions. Even worse, the set of reals representable by infinite decimal 
expansions is not closed under addition and multiplication (cf. E5.3.10). 

Most of the introductory accounts of intuitionism, BCM and CRM use 
method (a). However, ‘there are good reasons to study both methods, as we 
shall see later. 


2. Cauchy reals and their ordering 


The introduction of the set of reals R via Cauchy sequences of rationals 
(fundamental sequences) is quite similar to the classical procedure, once we 
have chosen our initial definitions appropriately. 


2.1. CONVENTION. Throughout this chapter we shall reserve n, m, i, j, k 
for natural numbers, r,s, ¢ (possibly with sub- or superscripts) for ra- 
tionals, a, 8, y for (infinite) sequences of natural numbers, and x, y, z for 
reals. 

For sequences ¢o,t),5,... (i¢.- functions on N) we shall use the 
“mathematical” notation (1,,), instead of the more “logical” An.t,. O 


2.2. DEFINITION. A fundamental sequence is a sequence (r,,),, of rationals, ’ 
together with a sequence B, its (Cauchy-) modulus, such that 


Vkmm'( Weim ~ "ek+m'| < 2-*). 
Two fundamental sequences (7,,),,(S,), are said to coincide (notation ~) 
if 

VkANWM( Fam — Sntm| < 27*). 


The set of Cauchy reals (or reals for short) R consists of the equivalence 
Classes of fundamental sequences relative to =. Observe that, as in the 
classical theory, = is indeed an equivalence relation. For the equivalence 
class of(r,),, we use the standard notation (7,),/=. O 
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REMARKS. From this definition it will be clear that the extent of R depends 
on the extent of N, the set of sequences of natural numbers. For a smooth 
theory, N™ should obviously satisfy some closure conditions. A very weak 
requirement, sufficient for the developments in the first few sections (2-4) 
of the theory of Cauchy reals, is AC-NN! (also denoted by ACgg!) 


AC-NN! — WndlmA(n, m) > JaWnA(n, an). 


In fact, even Jess suffices at most places. We shall return to this issue in 
4.6-7. 

Again the introduction of equivalence classes is not essential, cf. Bishop 
(1967), but it is certainly more convenient and closer to mathematical 
practice. 

If we can assume that our sequences satisfy the countable axiom of 
choice (cf. 4.2.1) ACg, we can simplify our definition of fundamental 
sequence to the more usual 


VkAnW mm (|Py 4m — Tram'| < 2-*). (1) 


In general, the whole theory simplifies considerably if we adopt ACg. 
However, in this chapter we shall not assume it unless explicitly indicated. 
This will have the advantage of making the reader more aware of the role of 
ACog in the development of the theory, and will permit us to apply the 
theory of this chapter in contexts where ACg) cannot be assumed (cf. 
chapter 15). 

At first sight one might expect that =~ also ought to have been defined in 
the explicit manner: (r,), = (Sn)_ = JYVAVM( |r 4m — Sykeml < 27%). 
However, we get y for free, as may be seen from the following 


2.3. PROPOSITION. Let (1), (Sn) be fundamental sequences with Cauchy 
moduli a, B respectively, and assume (r,,),, ~ (S_)n- Then 


Vkm(|r Nkim _ Sokcin| < g-*) 
for yk = max{a(k + 2), B(k + 2)}. 


PRooF. Let Wm(|ryim — Snam| < 2 *"'). Take an arbitrary meEN. If 
yk =n, we are done; so let yk <n, then yk + m’ =n for some m’, and 
Ir. yktm Sop | Ss Ir. yktm Nieew'l + Wyte m’ eS Syk+m'| + Sytem ome Syktm'l 
e grkri pHa g- bP 27k 
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2.4, DEFINITION (of the ordering relation on fundamental sequences). 
Let (7,),,,(5,), be fundamental sequences, and put 


CA < <5. = BknVm (Sim aie liam > 2=*), Oo 
2.5. PROPOSITION. < between fundamental sequences is compatible with = , 
i.e. 

ca < css A {r) oad ieee A ee i 2mm = Ce < Siac 
Proor. From the definition it follows that there are k and n such that 


Vm(s a Th+m > 2%), Vm (lnm ~ Tn+ml < i?) 


n+m 

Vm(Snim — Seaml <27*-7); hence Wm(s) 9 — Mam > 274). O 
2.6. DEFINITION. For r € Q define r* as the fundamental sequence (r,), 
with r,=~r7 foralln. O 


REMARK. r ‘+> r* is an order preserving embedding from Q into R, Le. 
r<sert<s*, 


2.7. DEFINITION of <, <, #¥ onR. 
X <= ry )y SX A Sy)n © (Clan < (Sudn)s 
xsy=Hly <x); 
x#y=(x<y)V(y <x). 
As usual, >, > are taken to be the relations converse to <, <. # is 


called apartness, x # y is read as “x is apart from y”. O 


Some authors, such as Bishop (1967), write # for # ; however, since 
“__x =y” is also used fairly frequently in constructive mathematics, we 
have reserved # for inequality. 

Warning: x < y is not used for x < y V x = y! Usually x < y is pro- 
nounced as “‘x is not greater than y” which is rather awkward. Perhaps one 
could use “x is down y”. 


2.8. PROPOSITION. Let (1,),, © X;(S,)n © Ys x, Y ER; then 
x # y @ AknWm(|Saim — lneml > 27*). 


Proor. Exercise. O 
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Apartness is a positive version of inequality which plays an important 
role in constructive mathematics. Its basic properties are given by the 
following 


2.9. PROPOSITION 

API aX#yYOx=y, 

AP2 x#yoy #x, 

AP3 x #y>W2(x #2Vz#y). 

Proor. AP1l: x = y ~ +(x #y) trivially follows from the definition of 


# , so it remains to show +(x #y) > x =y. Let Hx # y),(S,), EX; 
(tan = Y, then 


Vkn Wm (|Spim — fneml > 27"). (1) 
Also, if a, 8 are the moduli of (s,,),,,(1,), Tespectively, we have 

vknm (Seren Sekeml <2"), (2) 

Vknm(Ite+n — teeaml < ant): (3) 
Take k + 2 for k and max{a(k + 2), B(k + 2)} for n in (1), then 

Vin (Sn ha he?) (4) 


Now suppose that for some k and ™ |5,.4m— tn+ml > 27*, then 
=e 


Sat m! = tnt ml 2 [Seam = Leal a Sam’ a Sn+ml oe Le, neal 


> 27k =D Qoke2 a Q-k-1 
for all m’; this contradicts (4), hence Vmk(|S,.4m — 4n+m| < 27"). Hence 


we conclude that x = y. 
AP? is trivially fulfilled; we leave the proof of AP3 as an exercise. O 


2.10. COROLLARY. Equality between real numbers is stable, i.e. 

Vxy(anx =y>x=y). 
Proor. By AP1 and propositional logic x = y @ “(x # y) @ ——-(x # y) 
2 a(x = y). | 


2.11. PRoposiTION. For all reals x, y, z 
G@) x<yAysxr7x=y, 
(ii) x<yAy<z>x<z, 
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(lili) x<yox<zVz<y, 

(iv) x<yAysz2>x<z, 

(Vv) x<yAy<z>x<z, 
(vil) x<yAysz>7x¥<z, 
(vii) xs yo —(x<yVx=y), 
(viii) —-(x Sy Vy Sx), 

(ix) --+(x<yVx=ypVvy <x). 


Proor. (i) (x <yAy <x) > Ax>y)AAy > x) > Ax>yVy> 
x)? Ax #y > x =y (E2.1.1, API). 

(ii) Easy from the definition. 

(iii) x<y>mx#y, hence x#z2VzZ#y, SO (x<zVz<x)V 
(y <zVz<y). By (ii) this implies (x <zVz<y)V(x<zVz<y). 

(iv) If x < y, then by (iii) x < z Vz < y; z < y contradicts y < z, hence 
x<ypAysz2>x<z. 

(v) Similarly. 

(vi) Let x < y, y < 2; assume z < x, then z < y from x < y and (iv); 
this contradicts y < z, hence 4z < x, ie. x < z. 

(vii)—(ix) are left as an exercise. O 


2.12. Counterexamples, We recall the discussion in 1.3.5—6, and in particu- 
lar the definition of x4, (7,4),,, or equivalently x*, (7,*),, for a the character- 
istic function of A. Clearly 


x* #00 3k(ak #0), x*=0< Vk(ak = 0), so 
Ak(ak # 0) V =3k(ak #0) > x* FOV x* =0, 

and thus Vk(ak = 0) V AVk(ak = 0) ~ x* = 0 V x* # 0. Hence 
a3-PEM > —Vx(x #0 Vv x = 0), 
~V-PEM > =Vx(x #0 V x =0). 

It follows therefore from either CT, or WC-N (4.3.4, 4.6.4) that 
aVx(x #0 Vx =0). 

In particular, since x #0 @ x <0 V x > 0, we also see that 
aWx(x <0Vx>0Vx=0), 

i.e. the law of trichotomy is refutable: 


AVxy(x<yVx>yVx=y). 
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A more refined construction leads to the refutation of Vxy(x <y V y < x). 
To see this, let a, 8 be two sequences such that “(Ak(ak = 0) A 3k 
(Bk = 0)). We define x** via a fundamental sequence 7°? as follows 
0 if n3k < n(ak =0 Vv Bk = 0), 
Roe >" ifak =OAk<nAWm<k(am # 0), 
—2-* ifpBk=OAk<nAWm<k(Bm #0). 
Then 
(i) 72-8), is a fundamental sequence with modulus An.n + 1, 
(ii) x8 > 0 & Jm(am = 0), 
(iii) x8 < 0 @ Im(Bm = 0), 
and therefore 
xB <0 V x%8 > 0 @ —dm(am = 0) V —3m( Bm = 0). 


If we can find particular a, 8 such that —(Ak(ak = 0) A 4k(Bk = 0)), 
while —Jm(am = 0) V —dm(Bm = 0) is unknown, we have a weak coun- 
terexample tox <0 V x20. 

For example, let Bn := Imym,(2n+2=m,+m,Am,#2#m,A 
m, prime A m, prime); so WnBn expresses Goldbach’s conjecture. Let 
A(n) be as before in 1.3.5 and fix a, B by 


am = 0 @ —Am A Wk < mBk, 
Bm =0 © =Bm A Wk < mAk. 


Returning to the generalized form, recalling that “SEP” was the statement 
VaB[A(ak(ak = 0) A 4k( Bk = 0)) — —3k(ak = 0) V AAK(Bk = 0)], we 
see that 


SEP > =VaB(x"8 <0 Vv x™* > 0) 
hence 
ASEP > =Vxy(x <y Vy <x), 
and in particular, assuming either CT) or WC-N (4.3.6, 4.6.5), we find 
Vxv(x <y V y < x) to be false. 
2.13. DEFINITION. Let (7,), € x € R, (s,), € y € R; we put 


x ae y = {T, + Spud = ? x = = At, a Sadn/ ® > |x| = (nln/ * a 
Oo 
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Of course, we have to verify that this is a correct definition, that is to say we 
have to show that ¢|r,]),. (7, + 5,), and (7, — s,), are again fundamental 
sequences, and that 


Tadn = (adn A (Sdn = (Sad > (Wada = (ala A 
Tn ay eB es ce + Les A ee sa? Sy) 7 on ins Shae 


which is straightforward, and left to the reader. 
It is virtually immediate that for allr € Q,x ER 


r>O>-x<xdtr. 
The following proposition gives a convenient “positive” characterization 
of <: 


2.14. PROPOSITION. For x, yER 

x sy owk(x<y+2>*). 
Proor. If x < y, we have x < y+2>* for all k (by y<y+27* and 
2.11(v)). Conversely, assume Wk(x < y + 2~*), and let x > y, then for 


some m, x > y +27” (why?). Since also x < y +27”, a contradiction 
follows. Sox<y. O 


2.15. PROPOSITION. Let (r,), Ex ER, y,zER andreQ, 
(i) If Vm nr, — Tyaml < 27%), then |x — r,| < 27%. 

(ii) |x -—yl<riffx-rs<y<xtr. 

(iii) If |x — y| < rand |y — 2z| <r’ then |x -—z|<rt+r’. 


PROOF. (i) is left to the reader. 
(ii) Suppose (s,,), € y, then 
jx -—yl<sreVn(|[x -yl<rt+2")o 
ViaknWm(r + 27! = Iam — Sram > 27‘) @ 
WiaknV mm (tyam — 1 2+ 2 < Spam < Tram tt 27'- 274), 
which is equivalent to 
Vi(x —r—2!<y<x+rt+27); 


this is equivalent tox -—-r<y<x+r by 2.14. 
(iii) is left to the reader. DO 
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REMARK. This proposition gives just a bare minimum of properties of 
Ax, y.|x — y| which suffices for the time being; the next section presents a 
quite general and uniform treatment of “arithmetical” operations on R 
(such as +, -,max, min, Ax, y.|x — y| etc.). 


2.16. Markov’s principle. Obviously, the theory of real numbers could be 
considerably simplified if we would know that 
Vx(x #0>x #0). (1) 


In traditional intuitionism (1) is rejected, and accepted in constructive 
recursive mathematics. Markov’s principle is equivalent to (1), as was 
shown in 4.5.4. 


3. Arithmetic on R 


3.1. DEFINITION. Let x, y © R” or x, y € QQ", r © Q, and let 0 abbreviate 
the element (0,0,...,0) in R”. Then 
Ix-yl<rs=Vill <i<n-|x,-y| <r), 


IX] <re=|[X¥-—Oj<r:=Villsi<n->|xJ|<r). O 


3.2. DEFINITION. Let f € X > Z, where X and Z are Q” or R”. We put 
M,(m, i,k) = VX,¥ © X"(|X| < mA |] < ma 


[x —yls 27 > /f(%)—/O)I s 2), 
M,(i, k) = VmM,(m, i, k). 


f is uniformly continuous with modulus a iff VkM,(ak,k); f is locally 
uniformly continuous with modulus a if VmkM,(m, a(m, k),k). O 

On assumption of ACy, these notions may be equivalently defined 
respectively by: f is uniformly continuous iff WkAiM,(i, k), and f is locally 
uniformly continuous iff Wmk4iM,(m, i, k). 


3.3. REMARK. The most obvious definition of a function from R to R is via 
graph predicates: a function from R to R is given by a relation RC R XR 
such that 


Vx € RI!y ERR(x, y); 
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the function f described by R satisfies Vx € R R(x, fx). For uniformly 
continuous f, f is already determined by its restriction fp = ft Q. On the 
assumption of 


ACgi Vx4daA(x,a) > JBVxA(x,(8),), 


where (8), = Ay.B(x, y), fo is determined by some a © N > N such 
that 


Vinge oa) Yn = f(7,)), 


where (7,), is a standard enumeration of Q without repetitions. Presum- 
ably ACg) or ACo,! (with J!a in the premiss) are not sufficient for this 
conclusion. 


3.4. DEFINITION. Let f € Q" > Q, f locally uniformly continuous. Then 
the canonical extension f* € R” > R is defined by 


f*(%) = ((ST))n) « 


for any (7), EX. O 
We leave it as an exercise to show that /* is well-defined. 


3.5. THEOREM. Let f with a modulus be as in 3.4, then f* is the unique locally 
uniformly continuous function with modulus extending f. 


PRooF. For notational simplicity we shall assume f to be unary, and 
uniformly continuous (instead of only locally uniformly continuous); let « 
be a modulus for f. We first prove that f* is uniformly continuous with 
modulus a. Let (7,), € x, (5,), EY. X-— 2 <p <x +27%, and as- 
sume 


f*(x) > f(y) +208 (1) 
From (1) it follows that there is an ” such that 
Ve( f(t +m) — (Sram) aa ae > 0). 


Since |F, 4 — Spa ml < 2>% > WP tuam) —L(Sn+m)| < 274, it follows that 


—ak 
Vm (nim = Sniml > 2 : . 


and therefore (cf. 2.8) x < y V y > x, Le. we can now derive that for some 
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n’>n 
WM (Pym ~ Sa'em > 27%) VOM (Sam — arem > 27%), 


Thus we find (x — 27% >y) V(y—27*% > x), which conflicts with 
x—-2-* <y, y—2°-% <x; thus (1) is false and therefore f*(x) < 
f*(y) + 27%, and similarly f*(y) — 27* < f*(x). 

As to the uniqueness, let f° be any other uniformly continuous extension 
of f, say with modulus £. If (r,), © x, and for some n 


Vem (Fyn ome Taal < 2%), 


where yk = max(ak + 1, Bk), it follows that |x — r,| < 27~”*, so |x — r,| < 
27k Ix — r,| < 27F*, and therefore 


YO raia2™,. i =—ools2-% 
Thus | f*(x) — f°(x)| < 27**!. Since this holds for all k, we find f*(x) = 
f(x). O 


COROLLARY. The operations +,-,| | (absolute value), max, min, Ax, y. 
|x — y| can be extended from Q toR. O 


3.6. PROPOSITION. Let (X,,...,X,) = 0 express any algebraic ( poly- 
nomial) identity in +,-,| |,max,min, etc. which holds for all (x,,.-.,X,) 
€ Q"; then the same identity holds on R". 


Proor. Consider, for example, the law of distributivity, which may be 
expressed as 


x(y +z) — (xy + xz) =0. (1) 


Noting the fact that locally uniformly continuous functions are closed 
under composition, we see that the left hand side and the right hand side of 
(1) may be viewed as two distinct ternary functions coinciding over Q; 
hence their extensions to R coincide also; thus (1) also holds over R. O 


REMARKS. (i) The practical importance of this proposition is that we can 
transfer identities from Q, which are usually much easier to prove because 
of the decidability of = on Q, toR. 

(ii) We cannot, in general, extend 3.6 to propositional combinations of 
polynomial equations: it is not hard to show by means of a weak counterex- 
ample that we cannot show xy = 0 > x = 0 V y = 0 in R, though this is 
true over Q (exercise). 
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3.7. PROPOSITION. 

(i) x>yVx=y- max(x, y)=x A min(x, y) =), 
(li) x <y  max(x, y) = y © min(x, y) = x, 

(iii) |x — y| = max(x, y) — min(x, y), 

(iv) x<yextz<ytzxsyextzsytz, 
(v) |x-ylSzex-zsysxtz. 


PROOF. (i) is left to the reader. 

(ii) x <y@->A(x <y Vx =y) (2.11(vii)), hence with (i) we find 
——(max(x, y) = y), and by the stability of = , max(x, y) = y. Conversely, 
if max(x, y) = y and x > y, we have by (i) max(x, y) = x, hence x = y 
contradicting x > y; therefore x < y, etc. 

(iii) is immediate by the corollary in 3.5. 

(iv) we leave to the reader. 

(v) This property generalizes 2.15(ii). Observe that 

xsy>|x-yl=y-x, 
xByoh—yl=x-y, 
and since 
xsyr(y-x<zex-zsysxtz2), 
ysxro(x-yszex-zsysxtz), 
we have 
x<yVx2zyo(lx-ylszex-—z<y<xtz). 
Now —-(x < y V x = y) holds, therefore by propositional logic 


aa(|x -— yf Sz) @ aA(x-zsyAysxtz), 


from which (v) follows, using propositional logic and the stability of <. 
o 


3.8. COROLLARY. Let 9(X), p(X) be two polynomials (in +,-,| | etc.) such 
that p(X) < W(X) holds on Q"; then this inequality also holds on R". 


PROOF. 9(X) < (X) is equivalent to the polynomial identity 
max((x), ¥(¥)) = ¥(¥). O 
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3.9. PROPOSITION. 
(i) min(x, y) < x < max(x, y), min(x, y) < y < max(x, y), 
(ii) |x + y| Ss |x} + [yh [x — yl] = | 1x1 — [yl he 


Proor. Immediate by the preceding corollary. O 


3.10. PROPOSITION. 

@) x#yroxt+z#ytz, 

(Gi) xt+y#0>x#0Vy #0, 
ii) xy =1lL->x#OAy #0. 


PROOF. (i) Immediate, e.g. from 3.7(iv) and the definition of # . 

(ii) By (i), x + y # 0 entails x # —y, so by AP3 x #0 V —y # 0, hence 
x#O0Vy #0. 

(iii) Let (7), € x, (S,), © y, and let xy = 1. There is an i such that 
Vn((r,| < 2‘). Determine k such that Vm(|ry45¢4m — 1| < 271). Assume 
that for some m’ |s,, |< 27°}, then |ry4—Spam'l< 271 and hence 
lx m'Sktm’ — 1| > 27}, contradiction. Therefore Vm(|s,.,,| > 27/71), ie. 
y #0. Similarly x #0. O 


3.11. REMARKS. (i) Theorem 3.5 can also be applied to segments of Q, 
as will be clear from the proof. In particular, for any segment [r, r’] 
= {s:r<s<r’} such that 0 €[r,r’] the inverse \x.x~! is defined and 
locally uniformly continuous. Therefore an extension of Ax.x7! to R is 
uniquely defined for {x : x # 0} and satisfies x - x~' = 1 (by an argument 
as in 3.6). As follows from 3.10(iii), x 0 is a necessary and sufficient 
condition for the existence of an inverse x~! such that x- x7! =1, ie. 
with an existence predicate E 


Ex-lox #0. 


Here we encounter our first natural example of a function which is only 
partial on R. In classical mathematics, we can avoid the consideration of 
partial functions by an ad hoc device, in this case by stipulating 


0-1 =0. 


Intuitionistically, this only makes x~! defined on {x: x = 0 V x # 0}, not 
on the whole of R. We leave it as an exercise (E5.3.5) for the reader to show 
that Ax.x~! cannot be extended to a total function on R. 
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(ii) For all fundamental sequences (r,,),, we can define ((7,),)7' = (Sn)n 
by 


0 ifr,=0, 
Se Vol itr £0. 


Then, if (r,), #0, it follows that ((7,),,)~/ is a fundamental sequence and 
an : (tn) sa (t ‘ cs ford 1. 

(iii) More generally, 3.6 also applies to polynomials in functions f defined 
on segments {r: ry <r <vr,} of Q. This permits us to lift also arithmetical 
identities involving ~! from Q@ to R, whenever the expressions involved are 
defined, noting that ~+ is uniformly continuous in a neighbourhood of each 
point where it is defined. 

Once we have introduced R with its ordering and the principal arithmeti- 
cal operations, we can define many other mathematical notions in the usual 
way, such as metric space, limit point étcetera. We give some examples 
below. 


3.12. DEFINITION. A metric space is a pair (V, p), V a set, p a mapping 
from V X V to R, such that for all x, y,z EV 

() p(x, y)=0ex=y, 

(ii) p(x, y) = p(y, x), 

(iii) p(x, y) < p(x, z) + p(z, y). 

The elements of V are called the points of the space. O 


3.13. DEFINITION. If (V, p) is a metric space, x € V, a © R, then the open 
ball of radius a with centre x is 


U(a,x) = {y: p(x, y) <a}. 
Let Wc V. The closure W~ of W is 

W-:= {y:Wr > 04z € W(z € U(r, y)). 
x © V is an accumulation point of W iff 

xe (W\ {x}). 


Metric spaces can be given a topology with basis {U(r, x): r > 0A x € V}. 
If we talk in the future about the topology of a metric space without further 
explanation, we mean the topology defined above. O 
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In agreement with the definitions in 4.1.5 we have: W is open iff 
Int(W) = W, and the interior of W may be defined for metric spaces as 


Int(W) = {y:4r> 0(U(r, y) c W)}. 


A mapping f from a metric space (V, p) to a metric space (V’, p’) is 
uniformly continuous iff 


Vx > 0Ay > OVu,v € V(p(u,v) <y > p'( fu, fv) < x). 


3.14. DEFINITION. Let (V, p) be a metric space, (x,,), C V, x © V. 
{Xn n IS Said to converge to x (relative to the metric p) with modulus y 
(the modulus of convergence) iff 


vkn( p(x, Pere’) < arhy. 


Then x is said to be the /imit of (x,,),, notation x = lim(x,),. O 


3.15. EXAMPLES. R itself is a metric space with metric given by 
p(x, y) = |x — yl. 

Similarly, R” can be metrized by 
p(X, ¥) = max |x, — Jil 

In R, x is the limit of (x,),, iff for some y 
Vkn (|x at fe ay 


If we take all subsets of X as the open sets of X, X is said to be provided 
with the discrete topology. Discrete topological spaces are usually not 
metrizable, but the metric induced on N as a subset of R yields the discrete 
topology forN. O 


4. Completeness properties and relativization 
In this section we shall show that R is Cauchy-complete (metrically 
complete), but not order complete; we also discuss the relativization of the 


theory to suitable universes YW of sequences. 


4.1, DEFINITION. (x,,), is said to be a Cauchy sequence of reals with 
modulus (a, B), a EN? > N, BEN > N if each x, is given by a funda- 
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mental sequence (7, ,,)., With modulus (a), = Am.a(n, m), and where 


Vkmm'(|Xpkm — Xpeeml <2°*). O 


REMARK. On assumption of ACo ) we can simply define (x,), to be a 
Cauchy sequence of reals iff 


VkAnWmm'(|Xj4m — Xnam'l < 27*). 


4.2. THEOREM (Cauchy completeness of R). Each Cauchy sequence of reals 
(X,)» converges to a limit x. 


Proor. Let (a, 8) be a modulus for (x,,),,, and let y be given by 
yO = BO, y(n +1) = max(B(n + 1), yn) + 1; 


then (a, y) is also a modulus for (X,),, We put 5, °= Tyyn,n), yn Now 
[Xn — S,| < 27", and thus, since y(n + m) and y(n + m’) are not smaller 


ntm Sn+ml 


SB Sysa aml eae eae) Ral See we taal Tee Te | 
se ee ee ea a 
Therefore (s,),, represents a real x, and (x,), converges to x with a 
modulus An.y(n + 3): 
IX ~ Xy(n43)4 ml 
= [X= Syaal th Sacg = Ayre al ygiday = Feel 


<r l a 2-7-3 of 2727-3 <2". 0 


N.B. If we adopt the simpler definition of Cauchy sequence (4.1, remark), 
the above argument itself requires ACgg. 


4.3. DEFINITION. (x,), is bounded, if for some m EN VWa({x,]| < m). 
{X,)q iS monotone, non-decreasing if Vnm(n < m > x, < X,,), and mono- 
tone, non-increasing if Wnm(n <m— x, > X,,). A sequence is monotone 
iff it is monotone, non-decreasing or monotone, non-increasing. Replacing 
<,2 by <,> respectively yields the notions of monotone increasing, 
monotone decreasing and strictly monotone sequence. O 
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4.4. Bounded monotone sequences and the Bolzano—Weierstrass theorem. 
The Bolzano-Weierstrass theorem states that every bounded infinite set of 
teals has a point of accumulation; a well-known consequence is the fact 
that a bounded monotone sequence of reals has a limit. Constructively this 
is false, as follows from the following 


PROPOSITION (43-PEM). It is false that every bounded monotone sequence in 
N > Q has a limit in R. 


PrRooF. Define (7,7), by 


a 
n 


1-2°" if ak <n(ak = 0), 
2-2°-" if ak <n(ak = 0). 


Assume lim(r°), = x © R, then x < 2 or x > 1, and thus 44k(ak = 0) V 
4k(ak = 0) for all a; this contradicts ,3-PEM. O 


+J-PEM is a consequence of either CT) or WC-N (4.3.4, 4.6.4). However, 
CT, permits a much stronger sort of refutation which is typical for 
constructive recursive mathematics; namely we can show, assuming CTy, 
the existence of a bounded monotone sequence without limit. 


4.5. DEFINITION. A bounded monotone sequence (7,,),, without a limit is 
called a Specker sequence. O 


THEOREM (CT,). A Specker sequence exists. 


PRroor. Let f be any injection of N into N with non-recursive range (e.g. 
let f enumerate {x:dy7xxy} without repetitions) and consider (r,), 
defined by 


k 
n= DIK. (1) 
i=0 


(r,),% 18 obviously bounded and monotone; now assume (7,), to satisfy a 
Cauchy condition, then for any given m we can find k such that 


War, — Peenl <2>™)- (2) 


m is in the range of f iff the sum of one of the ns ends with 2~”. So 
determine k with Wn(|r, — %4,| << 2~™), then we only have to compute 
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the first kK values of f in order to test if m belongs to the range of f. Hence 
the range of f is decidable, and therefore, by CTy, recursive. Contradiction. 
Oo 


Later, in 6.4.8 we shall give a proof of a much stronger statement, namely 
the existence of a strong Specker sequence, i.e. a bounded monotone 
sequence staying apart from each specific x € R. 

Note that we have also shown, in 4.4, that R is not order-complete: not 
every bounded set of reals has a supremum (least upper bound). (Take the 
sets {72> n © N}.) 


4.6. Relativizing to a universe. The theory we developed so far is obviously 
correct if we take as our basis the most general notion of sequence of 
natural numbers, i.e. the variables a, 8,... range over all “legitimate” 
(constructively, intuitionistically acceptable) sequences. In this case, ACgo 
and AC,, seem to be intuitively justified, which simplifies the theory. 

The same can be said, if we take as our basis the notion of lawlike 
sequence or choice sequence. 

More generally, we may attempt to relativize the notion of real to some 
universe Y of sequences of natural numbers; for a smooth theory of reals Y 
will have to satisfy certain closure conditions. 

More specifically, let (7,,),, be a fixed enumeration of Q. Relative to this 
enumeration, we may consider @fundamental sequences (7,,,),, with mod- 
ulus 8, where a and f belong to &. 

Inspection shows that the positive theory of <, < is indeed applicable 
to @&reals, since it is a theory of relations, not of operations. Closure under 
specific operations, however, such as +,- requires also certain closure 
conditions on %; also, in our discussion of counterexamples and Markov’s 
principle certain closure conditions on Y are implicit, since we appeal to 
AC op!. 

At first sight it might seem as if the property: x < y @ Wk(x < y + 27*) 
implicitly appeals to closure conditions on &%, since if y is a real, 
y+27* need not be so unless Y satisfies certain closure conditions. 
However, if we let Y@* be the universe generated by W by closing off under 
certain suitable, explicitly given closure conditions, we can guarantee that if 
y is a Y*-real, then so is y+2~*; and for @reals x, y the property 
x <y © Wk(x < y + 2~*) is nothing else but a characterization of < on 
the #reals via < for %*-reals. 

Note that if we choose another fixed enumeration of the rationals, say 


{Ann With 9, = Ty», We may get another notion of @real; requiring 
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isomorphism w.r.t. <, on assumption that » € Y, again imposes certain 
closure conditions on %. 


4.7. Axiomatic basis. A detailed inspection shows that the theory of R we 
developed can be axiomatized in the language of EL (elementary analysis, 
cf. 3.6.2) with the following very weak axiom of choice 


QF-ACy) WndmA(n,m) > AaWVnA(n, an) (A quantifier-free). 


QF-AC,, as an assumption on Y expresses nothing but the fact that & is 
closed under “recursive in”. 


5. Dedekind reals 


In this and the following section we give a separate and independent 
treatment of Dedekind reals because (1) they illustrate the difference 
between the Cauchy-completion and the order completion of Q, and (2) 
Dedekind reals play an important role in sheaf models for analysis in which 
ACoo fails (cf. chapter 15). 


5.1, DEFINITION. Let S C Q. S is called a weak left cut (weak cut, or w.l.c. 
for short), or (bounded) extended real if 


(a) dr(r € S), ds(s € S) (boundedness), 

(b) Vrs(r <sA45sES~>reS) (strong monotonicity), 
(c) Vr © Sar’ € S(r <r’) (openness). 

If (a) is weakened to 

(a’) —49r(r € S), nAds(s € S) (weak boundedness), 


we obtain the weakly bounded cuts or classical reals, 

S is called a left cut (or cut) or Dedekind real (in this section simply 
called “‘real”) if S satisfies in addition to (a), (b), (c) 
(d) Vrs(r<s->reSvs€S) (locatedness ). 
The collections of Dedekind reals, extended reals and classical reals we 
denote by R4, R™ and R¢ respectively; obviously 


RICR’ CRE. 
Equality between weak cuts is simply equality between sets of rationals. O 
REMARK. (d) implies (b): let r <r’, sar’ € S; sincere S Vr’ ¢€S, we 
see that r © S. Therefore Dedekind reals may also be defined by (a), (c), (d). 


One often finds cuts defined as pairs of sets of rationals (cf. exercise 
5.5.3), but using left cuts we find technically less cumbersome. 
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Dedekind reals can be approximated by rationals ~ this is the content of 
the following proposition. Classical reals can be approximated “up to 
double negation”. 


5.2. PROPOSITION. 
(i) WaVS € R43r & S3r’ € S(|r — r'| < 27>”), 
(ii) VaWS € R°3r € S3r’ & S(\r — r’| < 27"). 


Proor. (i) Let S€R4, re S, t¢S and choose r=nm<r<rn-:: < 
r, = t such that r,,, —17,<27"7!, for i < k. By the locatedness of S we 
have 


Vi<k(r,ESV74,,€S), (1) 


and thus there exists a finite sequence my,..., m,_, © {0,1} such that 
m,=0>76ES, 
m=1>7,,€S8. 


Let j be the least i such that m, # 0, if existing, j = k otherwise. If j = k, 
we have r,_, © S, while it is known that r, ¢ S, hence (i) holds. If j = 0, 
we have ~ © S, r, € S, and again (i) holds; finally, if 0 <j <k —1, we 
have r,_, € S, 74, € S, and |r.) — 44) < 27". 

(ii) Let now S € R*. Then 


Wr'(r <r!’ > (rE S Vr €éS)). (2) 


For if r<r’, réS, ar’ € S we obtain from r <r’ A Ar’ © S with 
property (b) of (5.1) that r © S, contradicting r ¢ S; hence —+(r € S V 
r’ € S). Now assume 


res, t€58, (3) 


and choose rp,..., 7, as under (i); the proof under (i) shows that from (1) 
and (3) we find 


ar € Sar’ € S(|r—r'[< 27"); 
therefore, since (2) implies 

Vi<kA(4,ESV7,,€S8) 
which is equivalent to 


aoiVi < k(r,ESV 74,8) 
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(by repeated use of (4 A B) @ (—4A A = B)), we find 
— dr © Sar’ € S(\r-—r'|< 27"). 


This follows from (3), hence from 4r(r € S), 4t(t € S), and therefore also 
from ——4Jdr(r € S) and ——31(t € S), which hold for any S € R®. O 


5.3. PRoposiTion. Let S$ C Q. Then S € R‘ iff S is strongly monotone, 
open and satisfies 5.2(i). 


Proor. The direction from left to right has already been proved. Con- 
versely, assume 5.1(b), (c) and 5.2(i); we have to show 5.1(d). If r<s, 
determine r’ € S,r” € S, |r’ — r”|<s—,r, then either r’ <r, sor” <5 
and hence s € S (5.1(b)), or r’ > r, and then r € § (5.1(b)). O 


5.4. DEFINITION. 
S<T=SCT, 
Str={rt+r:r'ES} 
S<T:=3r>O0S+r<T). O 


5.5. PROPOSITION. For S,T € R4 
S<Teas(seTAs€S). (1) 


PRooF. Exercise. O 


REMARK. As the proposition shows, it is possible to use (1) to define < on 
R°. We might be inclined to think that we could also use this definition on 
Re or R°; however, the resulting notion is too strong, at least if we insist at 
the same time on the usual properties of < in relation to addition. For we 
want Wn(S < S + n~'); however, on definition (1) this implies S € R°, 
since if se § + n™1, s € S, we see that s — n~! © S, s € S, and since we 
can find such an s for all n, it follows that S is located (5.3). 


5.6. PROPOSITION. For all S, S’, S”,T © R® 
@ S<SAS' <S" —>S<S"; 

ai) S<S'AS’ <8" —>S8§<S"; 

(iii) S< Te-+(T<S); 

(iv) SS TATSS>S=T,; 

(vy) S<S'AS' <S" 9S <S". 
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Proor. Exercise. O 


(iii) shows that the relation between < and < is the same as for the 
Cauchy reals. 


5.7. THEOREM. R°,R°° are order-complete, that is to say 

(i) any inhabited, weakly bounded subset of R* has a supremum (least upper 
bound) in R°*; 

(ii) any inhabited, bounded subset of R°® has a supremum (least upper 
bound) in R*. 

Here a set CR is (uniformly) bounded iff Irr'VXEX(rExXxa 

r’ € X) and weakly bounded iff it is not not bounded (= not unbounded ). 


Proor. The proofs of (i) and (ii) are completely similar. If 2% is any 
inhabited subset of R°, put 


S = Int((U{x: xe #})"). 


We leave it as an exercise for the reader to verify that S is in fact the 
desired supremum. O 


REMARK. In 5.12 we shall show that R‘°,R® can in fact be uniquely 
characterized as the order completions of Q, w.r.t. weakly bounded and 
bounded sets respectively. 


5.8. PROPOSITION (definition by cases). Let A be an arbitrary assertion, and 
let So, S; © @ be such that if A, then Sy © R*, and if A, then S, © R*, 
then 

(i) there is a unique S € R° such that 


i: if A, 


S, ifA; (1) 


(ii) if it is known that S,, S, © R°° (independently of the truth of A,—A), 
then there is a unique S © R°* satisfying (1). 


PROOF. Take 
S = Int{r:((r € Sy A A) V (re S, A A))}. 


We leave the verification as an exercise. 0 
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In the next subsections we show that R and R¢ are isomorphic; AC-NN 
suffices. 


5.9. DEFINITION. S C @ is called a strong Dedekind real if S satisfies 
boundedness (a), openness (c), and the following strong form of locatedness 


(dV) Ja € Q?oNVrr'(r <r’ > (a(r,r’) =O0ArES)V 
(a(r,r’) #0Ar ES)). 


In the presence of ACyg9, a Dedekind real is always a strong Dedekind real. 
Oo 


5.10. PROPOSITION. The collection of strong Dedekind reals with < inherited 
from R° is order-isomorphic to R, by an isomorphism which is the identity 
on Q. 

Therefore, assuming AC, R and R® are order isomorphic via an isomor- 
phism which is the identity on Q. 


PRooF. Given any Cauchy real x, (r,), © x, we put 
p(x) = Int{r:dkVn(r,4, >7)}. 


Then (x) is a strong Dedekind real and leaves all rationals fixed. 

Conversely, let S be a strong Dedekind real, with a as required by (d’). 
Assume p € Z to be such that p € S, p + 1 € S. Choose a k EN, and let 
r,=pt+i-2-*, i< 2*+1. Without loss of generality we may assume 
a(7, 7) = 9; obviously a(p + 1, p + 1+ 7r) #0 for all r > 0. 

Put s, = 7; for j = max{i: a(r;, 7,,1) = 0}. Then 7, © S, 7, € S, hence 
|S — s,| < 2~*—1. Therefore (s,), is a fundamental sequence; let y be the 
mapping assigning (s,),/= to S. It remains to be shown that gy, y are 
each others inverse and preserve <j; we leave this as an exercise. O 


The next proposition shows how already a quite weak non-classical 
principle, namely .VP(4P V —4P), enforces that R¢ and R* do not 
coincide. So, if we assume ACy) and —VP(=P V ——4P), then the Cauchy 
completion of @ (i.e. R) and the order completion of @ (ie. R°*) are 
non-isomorphic. This shows that the two completions are quite different. 
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5.11. Proposition. VP(4P V —AP) is equivalent to each of the following 
assertions 


R= Ro, 

af € R°°>R4 (f is non-constant), i.e. 

af e R’°>R43x E R&Ay E R™( f(x) # f(y)). 
Hence 

«VP(4P Vv =P) > Vf E R™ > R4(f is constant). 


Proor. (i) Assume VP(4P V =P), and let S E R**, r <5; take r’ with 
r<r’<s, and by assumption ar’ € S Vv Ar’ € S; if ar’ © S, then 
s€S; if —Ar’ © S, then re S. Hence Vrs(r<s>reEeSvsE€S8), so 
S € R4, and thus R’ = R4, 

(ii) Assume R* = R4, then Ax.x is a non-constant function from R°¢ 
to R4. 

(iii) Suppose x, y € R°*, f(x) # f(y). By 5.8 we can define, for each 
proposition P, a number zp © R® such that 


_ fx ifaP, ° 
7P\y if. 


By the properties of # 


VP( f(zp) # f(x) V f(zp) #f(y)) 
and thus VP(,P V ——P). O 


REMARK. —VP(4P V =P) is weaker than —V-PEM, ie. AVa(—4x 
{ax = 0) V —44x(ax = 0)), which is a corollary of either WC-N or CT) 
(4.3.4, 4.6.4). O 


We now turn to the characterization of R4,R°*,R® as ordered structures. 


5.12. THEOREM (characterization of R4,R*,R° as ordered structures). 

(i) (R°°, <) is, up to an isomorphism which is the identity on Q, the unique 
strict partially ordered structure (X, <) extending (Q, <) such that for 
all x, y, z © X, withx <p = -(y < x): 

(a) x<zAysz2>7x<z; 
(b) xSyAy<z>x<z; 
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(c) xSyVAyYSKXPX=Yy; 
(d) x<yoadr>OVr’ <x(rt+r’<y); 
(e) drr'(r<x <r’); 
(f) every bounded subset of Q has a l.u.b. in X. 
(ii) (R°, <) is the unique strict partially ordered structure (X, <) extending 
(Q, <) such that (a)-(d) as under (i) hold, and in addition 
(e’) —A3rr’(r < x <r’); 
(f) every weakly bounded subset of R has a l.u.b. in X. 
(iii) (R4, <) is the unique strict partially ordered structure (X, <) extending 
(Q, <) such that (a)-(e) of (i) hold, and in addition 
(f”) each located, bounded downwards monotone subset of Q has a 
Lu.b. in X; 
(h) Vx © XVrs(r<srmr<xVx<s). 
Here Y C X is located iff Vrs(r<s7mreYVs€Y). 


PRroor. We shall establish (i) by proving the lemmas 5.13 and 5.14 below. 
The proofs of (ii) and (iii) require minor modifications which we shall leave 
as an exercise to the reader. O 


| 5.13. Lemma. Let (X, <) satisfy the conditions under (i) in theorem 5.12. 
Then each v & X is uniquely determined by {r: r < v}, in fact 
{rir <0) C{rip <a) ons 9’ 


for allv,v' € X. 


PROOF. The implication from night to left is immediate by property (a). For 
the converse, we first note that 


Vx,yEX(x<yoxsy). (1) 


Assume v’ < v. By (d) there is an r > 0 such that Vr’ < v’(r+7r’ < v). 
Now choose an r” such that 0 < r” < r, then 


Vr’ <u'(r’ +r’ <d). (2) 


Choosing some fixed r’ < v’, which is possible by (e), we may take r” + r’ 
for r’ in (2), since r” + r’ < v’ (cf. (1)). This yields 2-7” + r’ < v’, and by 
induction Vn(n +r” +r’ <v’). 

By (e) there is an r* such that v’ < r*, and so Va(n-r” +r’ < r*) 
which contradicts the archimedean ordering of Q@. Hence —(v’ < v), i.e. 
veuvw O 
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This lemma reduces the proof of 5.12(i) to the special case of partially 
ordered systems (2, C*) such that (i) % is a collection of subsets of Q, 
XC* Y= ar > O{x +r: x EX} C Y), (ii) the rationals are embedded 
into by rr {(r'ir’ <r}, and (iii) VX¥ E F(X = {rir < X}). c 
corresponds to <. The proof of 5.12(i) is then completed by 
5.14. Lemma. Let (%, C*) be a family of sets of rationals ordered by strong 
inclusion as indicated, and assume each bounded S C Q to have a l.u.b. in 
&; then (, <*) is isomorphic to R™. 


Proor. Exercise. O 


6. Arithmetic of Dedekind reals and extended reals 


6.1. The developments in this section run parallel to those for the Cauchy 
reals. The principal difference is that we have to do some extra work in 
proving the existence of a unique canonical extension of locally uniformly 
continuous functions. 

We define provisionally, for r € Q, S, S’ & R*° 


IS-S'|<r=S-rcS’cS+4r. 
One readily sees that 
|S-S|<srA|S'-S sr -|S-S<ertr’. 
For |S — S’|<r we can take S—r<S’<S+r. Tf S, S’ € (R°)", we 
define |S — S’] < r coordinate-wise as. in 3.1. 
The formulas M,(m, i, k), M,(i, k), and the notions of continuity, uni- 


form continuity and locally uniform continuity are defined as in 3.2, except 
that X and Z now may range over Q,R4,R™,R®. 


6.2. THEOREM. Let f€ Q" > R4%, f locally uniformly continuous on Q"; 
then there is a uniquely determined canonical extension f* & (R*)" > R4, 
given by 

f*(S) = Int{r: 3g € Sv7 © S(G’ > Jor <f(F))}, 
where (qj,-.-,4,) > (Q1.--+59,) = Vid s<isn-—>q; > q;). 
Proor. The proof is split into two lemmas. For notational simplicity we 


shall take n = 1 and assume f to be uniformly continuous, instead of only 
locally uniformly continuous. O 
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6.3. LEMMA. f* is uniformly continuous and satisfies 

Vik(M,(i, k) > Mp(i + 2, k)). 
PRoor. Assume M,(i, k), |S — S'| < 2~‘~*. We have to show 

PS) = 2 FCPS) cfs(S) 42%. 
Let r’ € f*(S’), then for some r” > r’ and some g € S’ 

vq €S(q'>qr>r’ <f(q’)). (1) 
Suppose qg” © S, q” > q—27'~*; we wish to show r” — 27>* < f(q”). 
From |S — S’| < 27'~? we get that q” — 27‘? © S’. We have either 
gq’ >qt+2°-** org-2'! <q” <q+2°'*. In the first case, by (1), 


r” < f(q” — 27*-?), hence r” — 2~* < f(q”). In the second case we can 
find a g* > q, |g* — q”| < 27'-*, q* © S’; then r” < f(q*), so 


p= 28s Hg") = Wile") = fay Sila"): 
Thus we have shown 

Vq © SWq” € S(q” >qr>r”—-2*<f(q")), (2) 
hence r’ € f*(S) + 27*, This establishes /*(S’) c f*(S) + 27* 
f*(S) — 2-* c f*(S’) is proved similarly. O 


6.4. LEMMA. Let g € R4 > R®4 be any uniformly continuous extension of f, 
then g = f*. 


Proor. Let M,(i, k), then also M,(i,k) on Q, since g extends f; by 
lemma 6.3 therefore M,.(i + 2, k). Assume 
rn€eS, nEéS, | -—n)< 27%, 


Then 7, — 2-72 ¢ Ser, +277, so [rn — S| < 27'?, hence 
If(7,) — f*(S)| < 27*, | fC) - g(S)| < 27*, and therefore 


If*(S) — g(S)| < 20-8". (3) 
But since Vk4iM,(i, k), this holds for arbitrary k, and thus f*(S) = g(S). 
Oo 


REMARK. Clearly, 6.3 and 6.4 taken together establish 6.2. In fact it can be 
shown that 


Wik(M,(i, k) > M,.(i, k)) 


(exercise). O 
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For R°*, R° we have the corresponding 


6.5. THEOREM. Let f € Q" > R' be locally uniformly continuous on Q", 
then there is a uniquely determined canonical extension f* € (R‘)" > R} 
(where i is be or e), defined by 


f*(S) = Int{r: 39 © Svq € S(7’ > G>7r<f(G))}. 


Proor. The proof follows the same track as for theorem 6.2; we have to 
make slight adaptations in the proofs of lemmas 6.3, 6.4. 


Adaptation of the proof of 6.3. If M,(i,k), |S — S'\< 2°, and r'e€ 
f*(S"), we only have that for some r’’ > r’ 


334 € S'Vq' E S'(q'> qr" <f(q')). (4) 


Now assume for g € S’ (1) as before, and deduce (2) as before. Then 
ag € SVq € S(q'>q-r”" < f(q’)) implies (2), hence from (4) we find 
=—(2), Le. 


373g € Sq" € S(q” >q>r"-2*<f(q")), 
and then r’ € f*(S) + 2~*etc. as before. O 


Adaptation of the proof of 6.4. Assuming M,(i,k) and dr, S4n€és 
(|r, — %,| < 27-2), the proof of lemma 6.4 shows (3); now from —47, € S 
ar, € S(\r, — | < 27'-?) we find —-(3), which is equivalent to (3); and 
since S € R®, —37, € Sar, € S(\n, — | < 27'7) holds, etc. O 


REMARK. The reader should note that the adaptation of the proof of 6.2 has 
a routine character: all that is needed is the assertion in appropriate places 
of some double negations, with an appeal to laws such as (A > B) > 
(“iA > ——4 8B). From a logical point of view this becomes understandable 
if we realize that the adaptation corresponds to an application of the 
(Gédel—Gentzen) negative translation to the proof of 6.2; cf. section 2.3 
and 7.4 below. 

Note also that the definition of f* in 6.5 coincides on R‘ with the 
definition of f* in 6.2. OD 


6.6. Arithmetic on R°,R®,R. Functions such as +, -,max,min, | | may 
now be extended to R4,R°,R with preservation of identities valid on Q 
as indicated before in the case of Cauchy reals (cf. 3.6); details are left to 
the reader. O 
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6.7. DEFINITION (of apartness). Let S,T € R4; define 


S#T=4k(|S-T|22-*). oO 


6.8. Proposition. For S, S’, S” € R4 
(i) S#S'OS'#HS: 

(i) S=S'o_(S #8; 

(ii) S#S’ eo (S <S)V(S' <S); 
(iv) S#S’ > (S#S")V(S’#S). 


Proor. Left to the reader. O 


N.B. Of course, the definition of # also extends to R°, but in this case the 
crucial properties (iii), (iv) cannot be proved. O 


6.9. Discontinuous functions. As noted already in chapter 1, we cannot give 
an example of an everywhere defined discontinuous f € R — R. However, 
if we permit the functions to take values in R® or R°*, we again encounter 
the phenomenon of functions total on R (or R®*, R©) which are discontinu- 
ous. For example, there is a total f € R > R° such that f(0) = 0, f(x) =1 
for x # 0. Also we can define a total f’ € R — R® such that (0) = 0, 
f(x) = x7! for x #0; however, there is no such f’ € R > R°* (exercise). 

Oo 


7. Two metamathematical digressions 


7.1. Kripke models for R. In 3.3.8-10 we illustrated the use of Kripke 
models in the study of intuitionistic second-order arithmetic by studying a 
particular example. Now we return to the same example and show that it 
can also be used to obtain some insight in the various extensions of R. 

Given some standard enumeration of the rationals, it is clear how sets of 
rationals ought to be interpreted in the Kripke models of 3.8.8, namely as 
families = {S,:k © K} of sets of rationals such that 


kk’ >5S,cCS, 
and of course 
kitreSf=res,. 
What do the elements of R4 look like in the model? The boundedness 
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condition has to be realized at the roots of the trees considered, i.e. there 
are r,s @@Q such that re Y, s¢€YFf are forced everywhere, ie. 
Vk Ee K(r€S,As5¢€¢S,). The condition of openness translates into open- 
ness of the S, at each node. 

Monotonicity is guaranteed by letting the S, be monotone at each node. 
Strong monotonicity corresponds to a more involved closure condition in 
the models: if r < s, and the nodes k’ > k where s € S,, are dense above 
k (ie. Wk" > kak’ = k’(s © S,,)) then r € S,. 

Locatedness is a very strong condition, since existential quantifiers are 
“instantly” realized at each node separately; i.e. the decisionsr © SV s € S 
for all r <.s are already made at the root and are hence not subject to 
changes at higher nodes. 

So a Dedekind real, i.e. an element of R4, is in fact represented by a 
constant family of left cuts Y= {S,: S, = S for all k € K}, with SE R4. 
Therefore R@ is, in a manner of speaking, in these models interpreted by 
itself. 


7.2. Kripke models for R°*. For elements of R°, the outcome is different: 
dropping the locatedness permits also non-constant families Y as elements 
of R°* in the model. For example, in the model over the binary tree there 
occurs the element shown in fig. 5.1. The element ¥ is specified by 
indicating S, at each node; we write r* for {s: 5s <r}. Obviously, the # 
in the picture is distinct from all “ representing elements of R@ in the 
model. On the other hand, though clearly # 1+ 0 < Y< 1, no Dedekind 
real in the model is apart from &, that is we have 


Hie VS’ ERI 0 [0,1] nak (|Y’ — A| > 2-*) 
(exercise). 


7.3. Singleton reals. It is instructive to consider also the intermediate 
possibility of the “bounded singleton reals” RS which may be defined by 


SER :=SER*™A—SER*’. 


In our models an element of R°S is a family Y of R°* such that above each 
node there is a node where Y becomes constant (i.e. becomes, locally, an 
element of R4 in the model); in short, the nodes where YE R® is forced 
are dense in the tree. Similarly one defines 


SERS:=SERSASER!S, 
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Fig. 5.1 


Fig. 5.2 shows two examples of elements of R°’. More information on 
singleton reals is found in van Dalen (1982) and Troelstra (1982), section 4. 


7.4. R© as the image of R° under the negative translation. The results in 
sections 5 and 6 show us that R° behaves in many respects just as R@ does 
in classical analysis. 

There is a general logical explanation of this fact: R° can be obtained as 
the negative translation of R® (cf. section 2.3). The setting is e.g. the 
language of intuitionistic second-order arithmetic HAS (cf. 3.8.4). We have 


SERt= 3s(sES) AAs'(s' ES)A 
Vsr(s<rArES>s ES) AVWs €SAs’ € S(s <s’). 


The right-hand side of this definition translates (modulo the equivalences 
=-5xA @ AVWxiA, a45< res < rete.) into 


4s(s € S*) A —4s"(s’ € S*) 
AVWsr(s<rAreéS®&>seS) 


AWs € S°—Aas’ © S“(s < 5’). (1) 
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That is to say, S° is weakly bounded, monotone and weakly open (a 
monotone X C Q is weakly open iff Vr € X——4r’ € X(r < r’)); of course, 
S is also stable, i.e. ~4ar € S® > r © S®, We can now prove the follow- 
ing 


PRopOSITION. Let R{ consist of the subsets S CQ, which are weakly 
bounded, monotone, stable and weakly open; then R§ is order-isomorphic to 
R° bypE RIOR‘, p | ERO RE, where 


e(S) = Int(S), p-'(S) = S*; 
@ leaves Q (essentially) unchanged. 


Proor. Routine and left as an exercise. O 


Note also that the preservation of order plus the fact that @ is left 
essentially unchanged by the negative translation has as a result that 
arithmetical relations are also unaffected by the negative translation. In the 
next subsection we shall show by an example how the negative translation 
may be applied to obtain facts concerning R°. 
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7.5. EXAMPLE. A mapping in Q@ — R4 may be represented by a binary 
relation X C @ X Q such that Fun(X) holds, where 


Fun(X) = wr(X,eR4) with X,= {r': X(r,r’)}. 
Let, as before, |X, —-X,| < 2~* express 
Na 2 CX re Xe 2 
and put 
My(i,k) = Wrr'(|r - r'| < 275 > |X, - X,] < 27), 
A, = Int{r:dr’ € YWr" € ¥(r" > r’ ere X,)} 
(Ay is arithmetical in X and Y); also 
M,(i,k) =VY,Y' ER(|¥ — ¥'| < 27! [Ay — Ay] < 27*). 
Now we can prove with classical logic 
Wk4iM,(i, k) A Fun(X) > VX © R4(A, € R*) 
AWik( My(i, k) > M,(i, k)) A Wr(X, = A,0); (1) 


this states that A, as a function of a real Y represents a uniformly 
continuous extension of f€ @—R*4 coded by X; Ay has the same 
modulus of uniform continuity as f has. 

Under the negative translation this is transformed into the corresponding 
statement for R‘, except that the premiss Vk3iM,(i, k) is replaced by the 
even weaker assumption Vk——=4iM,(i, k). Thus one establishes theorem 
6.5 for R° (minus the uniqueness of the extension). 

In fact, the uniqueness can also be established by this method, but that 
requires the extension of the negative translation to a language containing 
variables for functions in R? > R¢ (cf. Troelstra 1980, section 7). 


8. Notes 


8.1. Definition of R and R4. The material in section 2 is standard in the 
literature on constructive mathematics. Bishop (1967) and Bishop and 
Bridges (1985) prefer working with fundamental sequences with a fixed rate 
of convergence, but the difference is inessential. Note however that a proof 
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of isomorphism of the reals as defined here, and the reals defined via 
fundamental sequences with a fixed rate of convergence, requires some 
closure conditions on N — WN (closure under “recursive in” is amply 
sufficient); this becomes relevant if we wish to study definitions of R 
relative to some universe &@C N — N for which those conditions are not 
automatically satisfied (think e.g. of the case @= LS, the universe of 
lawless sequences, cf. 4.6.2). 

The Dedekind definition of the reals, i.e. R4, made its appearance already 
in Brouwer (1926), but as long as countable choice is assumed, or built into 
the definition by a strong definition of locatedness ((d’) in 5.9) there is not 
much to choose between the two, since then R and R®@ are isomorphic 
(5.10). 

The apartness relation in the real plane was introduced by Brouwer 
(1919, “drtlich verschieden”); the basic properties AP1-3 were formulated 
by Heyting (1925). Brouwer also observed the stability of = on R? (1923). 


8.2. Order completeness. The failure of the theorem that every monotone 
sequence of reals has a real limit, and hence the failure of the 
Bolzano—Weierstrass theorem, stimulated the search for extensions of R or 
R‘4 with better properties of order completeness. A survey of these attempts 
may be found in Troelstra (1980, 1982). R°* turned out to be the canonical 
answer to the question: what is the order completion of @? If one requires 
order completeness with respect to weakly bounded subsets of Q, the 
answer is R°. 

R°° was investigated by Staples (1971); Troelstra (1980) contains a rather 
detailed exposition of the basic properties of R°* and R°, which formed the 
basis for the presentation in the present chapter. Lorenzen (1955) uses 
Dedekind cuts in a predicative framework where the use of classical logic is 
justified via the negative translation. Thus the reals considered there are 
close to R® from our point of view. Our definition of R° differs from the 
one used by Staples in two respects: we only work with left cuts, instead of 
left and right cuts (cf. E5.5.3) and instead of a defined equivalence relation 
on cuts we simply use set-theoretic equality; for this reason we have 
required strong monotonicity ((b) in 5.1) instead of monotonicity only. 

Certainly, R is far less important than R in constructive mathematics. 
Nevertheless R°* has its uses, for example in studying duals of Banach 
spaces and reflexivity of such spaces in a manner which follows the classical 
theory more closely than in the treatment of e.g. Bishop and Bridges (1985, 
section 7.6); see for example Troelstra (1980, section 6). 
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8.3. Specker sequences. Specker (1949) gave the first proof of theorem 4.5; 
the particularly simple proof presented here is due to Rice (1954). The 
existence of strong Specker sequences was shown as part of the proof of 
theorem 4.3 of Zaslavskij (1962). More recently Rice’s construction has 
been investigated in Kushner (1981, 1983). 


8.4. Reals in CRM. It is an oversimplification to say that the various 
notions of real number in the literature on CRM are obtained by taking 
some classical definition and interpreting the logical operations according 
to numerical realizability (section 4.4). 

Indeed, the notion of ““FR-number” or “real duplex” corresponds to our 
notion of Cauchy real interpreted by realizability, and the pseudo-numbers 
defined by sequences of rationals (r,),, satisfying 


Wk nV (Ir 4m — Tham < 27%), (1) 
modulo an equivalence relation = given by 
(adn = (Sn)n = Wk (ym ~ Snel < 2-*) (2) 


correspond to the non-oscillating sequences of the intuitionistic literature 
(e.g. Dijkman 1952) modulo =. Realizability is in the latter case irrelevant 
since (1) and (2) are almost negative, i.e. truth equals realizability in this 
case (cf. 4.4.5). 

However, in the literature on CRM one also studies “F-numbers” given 
by an algorithm for a fundamental sequence (7,),,, but the algorithm for the 
modulus of convergence is not given, that is to say operations on F-num- 
bers are algorithms acting on the algorithm for the sequence only, not on 
the algorithm for its modulus (see e.g. Shanin 1962). Thus considering 
F-numbers in CRM is the same as considering FR-numbers while restrict- 
ing attention to those algorithmic operations depending on the code of the 
sequence itself only. 

It hardly needs saying that such a notion is rather awkward from the 
logical point of view. 


Exercises 


§.2.1. Prove AP3 in proposition 2.9. 


5.2.2. Prove 2.11(vii)—(ix). 
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§.2.3. Check that the operations in 2.13 are well-defined; prove 2.15(i), (iii). 


5.2.4. Give a weak counterexample to the assertion “each real number is either rational or not 
rational”. 


5.2.5. Give an example of a real x such that x is not known to be rational, though 4(x is 
rational). 


§.2.6. Prove that 
f+ 0) 
e= >) ifn! 
n=O 


has a positive measure of irrationality, ie. there is a g € N? > N such that Je — p- qo} > 
(Pp, 9)" for all p, g & N (Heyting 1956, 2.3). 


$.3.1. Show that /* in 3.4 is well-defined. 
5.3.2. Give a weak counterexample to Vxy(xy = 0 7 x =O Vy =0). 
§.3.3. Prove 3.7(i), (iv). 


5.3.4. Show that for all x,y,z ER: x<yoxtz<ytz,x<yAz>O0>7x-72<y-z, 
x<yAz<O07KxK-2>y-z, 


5.3.5. Show that we cannot expect to find a total extension of Xx.x7! (3.11). 


5.3.6. Show that x # y = p(x, y) > O defines an apartness relation on a metric space (V, p) 
(i.e. # satisfies AP1-3 as in 2.9). 


5.3.7. Show that, under the usual definition of the metric for separable Hilbert space 7,0? is 
also constructively a metric space. Define intuitionistically a suitable metric on Baire space 
yielding the usual topology (cf. 4.1.5). 


5.3.8. Let (V,, p,) be a countable sequence of metric spaces; define an appropriate metric on 
T1{V,,: n © N} (cf. 4.1.5). 


5.3.9. Let @ be the collection of fundamental sequences and let x9, x, € #, A = {Xxq, x}, 
B= {(x9) = »(X;) ~}. Then Vy © BAx € A(x € y). Show by means of a weak counterexam- 
ple that we cannot expect to prove 3f € B> AVy € A(f(y) € y) for suitably chosen x, x;. 
In fact, Vy € B3x © A(x © y) > Sf © A> BVy © B( f(y) € y) for arbitrary two-element 
sets A, B as above implies 3-PEM. 


§.3.10. Show, by means of a weak counterexample, that the set of reals representable by 
infinite decimal fractions is not closed under addition. 
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5.4.1. Show that 3-PEM suffices to prove “every bounded monotone sequence of reals has a 
limit”. 


5.4.2. We call a set infinite if it contains a subset in 1-1 correspondence with N. Prove 
constructively: let A be any bounded infinite set of reals, and let m,n © N; then there is an 
interval of length 2~” containing at least m elements of A (Brouwer 1952; Heyting 1956, 
3.4.6). 


5.4.3. We call a sequence of reals (x,,),, non-oscillating iff Vk anVm(|x, — Xpaml| < 27*). 
Show that a bounded monotone sequence is non-oscillating. 


5.4.4. A sequence of reals (x,,), is said to be negatively convergent iff —W4xVkInVm 
(1x — Xn4m| < 27*) (Dijkman 1952). Give an example of a non-oscillating sequence which 
cannot be shown to be negatively convergent (see e.g. Heyting 1956, 1.3.2, example (ii)). What 
can we say on the basis of CTy? 


5.4.5. Is “(x,), is negatively convergent” a weaker statement than Jx—4VkAnVm(|x — 
Xn ml < 2%) 


§.5.1. Prove 5.5 and 5.6. 
5.5.2. Complete the proofs of 5.7, 5.8 and 5.10. 


5.5.3. An equivalent definition of R4 is as follows. A Dedekind cut is a pair (S,T) with 
S ¢ Q, Tc Q such that 


(1) Vr € SVr’ € T(r <r’) (disjointness), 
Q) 3rr(rESaAr ET) (boundedness), 
(3) Vr 6 S3r’ © S(r <7’), Vr € Tar’ € T(r’ <r) (openness), 

(4) Wr(r<r' > reESvreET) (locatedness). 


Define now < and < and show how this notion of Dedekind cuts yields a system 
order-isomorphic to the system of our original definition. Can you adapt this symmetric 
definition to R°®, R°? 


5.5.4. Assuming Kripke’s schema KS (4.9.3), show that each element of R° is the supremum 
of a bounded monotone increasing sequence of rationals. 


5.5.5. Assuming CTp, show that not every element of R°* is the weak limit of a non-oscillating 
sequence. Here X € R°* is said to be the weak limit of a sequence (x,,),, iff 


Vk In m(|X — x,| < 2-*) 


(cf. section 2 of Troelstra 1982). 
5.5.6. Complete the proof of 5.12 by proving the second lemma 5.14 and (ii), (iii) of 5.12. 
5.6.1. Establish the remark after 6.4: Vik(M,(i, k) > M,-(i, k)). 


5.6.2. Give proofs of 6.5 in full and prove 6.8. 
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5.6.3. Show that we cannot expect to find an f’ € R > R™ extending Ax.x~'. Hint, Show 
that such an f’ cannot exist if we assume WC-N (Troelstra 1980, 5.6). 


5.6.4, Show that R© and R® do not carry an apartness relation, i.e. there is no binary relation 
that satisfies AP1-3 of 2.9. Hint. Cf. 5.11. 


5.7.1. Prove the statement at the end of 7.2, and show that Y cannot be a singleton real in %. 


5.7.2. Define R** to be the collection of “near-singletons” among the subsets of R, i.e., 
HER* = VS,TEX(S=T) A—3S(S EF). 

For %, 2’ & R** put 
R<X' =WSEIVTELR(S<T). 


Show that R‘* is order-isomorphic to RS by an isomorphism mapping {r%} to r* for all 
r © Q (Troelstra 1982, 4.2). 


5.7.3. Show that every X € R‘, respectively any X © R°* can be obtained as the limit of a 
Cauchy sequence of elements in RS, respectively RS (van der Hoeven in Troelstra 1982, 4.3). 


5.7.4, Prove the proposition in 7.4, 


CHAPTER 6 


SOME ELEMENTARY ANALYSIS 


In this chapter we present some examples of constructive analysis, so that 
the reader can see some constructive mathematics “at work”; no systematic 
treatment is attempted. The first two sections contain some basics, such as 
the intermediate value theorem, differentiability, Riemann integration and 
Taylor series. These sections presuppose sections 5.1-5.4, 

In section 3 we derive some consequences of the weak continuity princi- 
ple WC-N and the fan theorem; this material relies on the sections 4.6—-4.7. 

Finally, section 4 contains some examples of analysis in constructive 
recursive mathematics; specifically, we give a number of “pathological” 
results based on the non-compactness of [0, 1] under the assumption of CT, 
(such as the existence of a continuous, but not uniformly continuous 
real-valued function on [0,1]) and a positive result requiring ECT, + MP, 
namely that all functions f€R — R are continuous. This section pre- 
supposes sections 4.3 and 4.5, and for the final result, based on ECT, + MP, 
also section 4.4 is needed. 

Our notions of “continuous function” and “uniformly continuous func- 
tion” are the usual ones; cf. 5.3.13, 5.3.15, 5.3.2. As long as we assume ACoo 
we do not have to introduce the modulus of continuity explicitly. The basic 
properties of continuous functions from traditional mathematics, such as 
closure under addition and multiplication, are also constructively correct 
and we will apply them below without proving them. The reader can easily 
supply proofs, usually by (almost) copying the classical proof. 


1. Intermediate-value and supremum theorems 


1.1. DEFINITION. Let a,b © R, a < b. The open interval (a, b) and the 
closed interval (or segment) (a, b] are defined as 


(a,b) = {x:a<x<b}, 
[a,b] = {x:a<x<b}. 
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We can define half-open intervals (a, b],[a, b) in the same way. Extending 
R to R* by addition of — 00, +00 with Vx © R(-— 00 < x < 0), we can 
define (— 00, b),(— 0, b], (b, 00), [b, 00); R = (— 0,00) as above. O 


REMARK. The definitions of (a, b), [a, b] can be extended to arbitrary pairs 
a, b, i.e. without the condition a < b: if c = min(a, b), d = max(a, b), we 
put 

(a,b) =(c,d), [a,6] = [c,d], 
where (c, d) and [c, d] are defined as before. 


We shall now study the constructive validity of the intermediate-value 
theorem. We first present a 


1.2. Counterexample to the intermediate-value theorem. The intermediate 
value theorem cannot be proved in the strong form, familiar from classical 
analysis, with 3x € [a, b]( f(x) = c) as its conclusion. A weak counterex- 
ample is easily constructed: let a be a real number for which a < 0 or 
a > 0 is unknown, and define a uniformly continuous f on [0, 3] by 


f(x) = min(x — 1,0) + max(0, x — 2) 
(see fig. 6.1) and let 
f(x) = f(x) +4. 


(0, 1) 


(1, 0) (2, 0) 


(0, 1) 


Fig. 6.1 
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Then, obviously, if f*(x) = 0 for some x € [0, 3], we can decide whether 
x <2 or x > 1; but in the first case a > 0, and in the second case a < 0. 
Therefore we are unable to determine x with any desired degree of 
accuracy. It is routine to transform this weak counterexample into a proper 
refutation of the universally quantified statement on the assumption of 
—SEP (cf. 4.3.6, 5.2.12). 

Note that in terms of classical mathematics the counterexample shows 
that intermediate values cannot be found continuously from the data, i.e. 
the function itself as element of a metrizable function space. Here the 
metric is the familiar supremum metric: 


e(f,g) = sup{|f(x) — g(x)|: x € [a, b]}; 


as we shall see below (1.8), this metric is constructively well-defined. O 


1.3. PROPOSITION (connectedness of the interval). Leta < b, and let A, B be 
inhabited open subsets of [a,b] covering [a, b], i.e. [a,b] = A U B; then 
there isareald& ANB, 


PRroor. We construct Cauchy sequences (a,),, C A, (b,), C B, lim(a,), 
= lim(d,),, = ¢ as follows. A and B are inhabited, so let ay € A, by € B. 
Put co = 4(aq + bo); then cg EA or CoE B. If co A, we can take 
a, = Co, 5b, = bo, and if co © B, we may take a, = ag, b, = co. Then 
|a, — b,| = 4]ag — bo] and a, € A, b, © B. Repeating this we find se- 
quences (a,), C A, (b,), © B with {a, — b,| < 27"laq — bol. Obviously 
(a,,), and (6,),, converge, so let c = lim(a,,), = lim(b,),, ¢ € A or c € B; 
say c © A. A is open, and therefore U(2~*, c) C A for some k, hence for n 
sufficiently large, b, € U(2~*,c) C A, so b, € ANB and we may take 
d:=b, O 


1.4. THEOREM (intermediate-value theorem). Let f € [a, b] > R be continu- 
ous, a <b, f(a) <c < f(b), then 


Wkax & [a,b] (f(x) — ¢] < 27*). 


Proor. A = {x: f(x)<c+27*} and B= {x: f(x)>c—27*} are 
open because of the continuity of f, and a € {x: f(x) <c+27*}, bE 
{x3 f(x) > ¢-— 27*}. (A, B} covers [a, b], since c-— 27* <c + 27*, and 
so f(x) < c+ 27-* orc — 27* < f(x). By the preceding lemma we can find 
an xin the intersection, which therefore satisfies ¢c — 2~* < f(x) < 
orl. -B 
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The following version of the intermediate value theorem retains the 
strong conclusion of the classical version, but strengthens the hypothesis: 


1.5. THEOREM. Let f be continuous on [a, b], a < b, f(a) < 0 < f(b), and 
such that Wy, z € [a, b\(y < z > Ax © [y, z]( f(x) # 0)). Then 
dx € [a, b]( f(x) = 9). 


Proor. The proof is a slight variation on the well-known bisection argu- 
ment: x is obtained as the common limit of (x,,),,(¥,), with Vn(x, < 
Xnt1 <Ina1 SIn)> A(X) < 9, f(y,) > 0 for all n. For x, yo we take a 
and b respectively. Suppose we have already constructed x, y,. Find a z 
in a segment J with midpoint }(x, + y,) and length 4(y, — x,,) such that 
f(z) #0; then f(z) < 0 or f(z) > 0. If f(z) < 0, take x4, = 2, 41 = 
yy if f(z) > 0, let x4) °= Xn. Yeti = Zz. In both cases |[x,4) — i 4al 
< 3x, — y,|. Hence (x,,), and ¢y,), have the same limit. Because of the 
continuity of f, f(lim(x,),) < 0, f(lim(y,,),) = 0, so f(x) =0. O 


REMARK. The condition of the theorem is satisfied, for example, by all 
polynomials (see 2.7). 

Next we shall show that a uniformly continuous function on a segment of 
finite length always does have a supremum, though the supremum is not 
necessarily assumed. 


1.6. DEFINITION. We recall that x is a supremum (sup) or least upper bound 
(l.u.b.) of a set X C R iff 


Vy E X(y sx) A Wkay © X(y>x-27*), 


Similarly for an infimum (inf) or greatest lower bound (g.1.b), we require 
Vkay © X(x +2°-* > y). The supremum (infimum) of a function f on 
[a, b] is the sup (inf) of { fx: x € [a, b]} (notation: sup(f), inf( /)). 

X CR is totally bounded if for each k © N there are xo,...,X,_) © X 
such that Vx © X3i < n(|x — x,)<27*). DO 


1.7. Lemma. A totally bounded X C R has an infimum and a supremum. 


Proor. For each k let { xX, 9,---+ Xk, nck) } be such that Vx © XJi < n(k) 
(|x — x, | < 27*). Let x, = max{x, ;:i < n(k)}. Now it is easy to see 
that Vx © XWk(x < x, +27*). 

From Wx € XWk(x < x, + 2~*) we conclude x,4,; <x, + 27* and x, 
= Xyyq 2"), So: |x, = Xpaih = 27... Hence we get. |xp—2xp4,(< 
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2-4 Q-K 14 Q-k-m 1 < Q-#t1 (xy) is therefore a Cauchy sequence 
with limit x. We leave it as an exercise to show that x = sup(X). O 


1.8. THEOREM. Let f € [a,b] - R be uniformly continuous on a < b, then 
sup( f) and inf( f) exist. 


PRoor. Let a be a modulus for f such that Vx, y € [a, b] Vk(|x — y| < 
2- ok 5 | f(x) — f(y)| < 27*). { fe: x € [a, b]} is totally bounded: let n 
be such that (b — a)n7! < 27°, and let x,:= a+ i(b—-—a)n7' fori<n, 
Then by uniform continuity Vx © [a, b]di < n(|f(x) — f(x,)| < 27"). 
Now apply the preceding lemma. O 


1.9, CoroLLaRy. For f [a,b] uniformly continuous, a <b, we have 
Vk ax € [a, b\(sup(f) — f(x) < 2>*) and WkAx € [a, b](f(x) — inf(f) 
ee Saat) a | 


1.10. Counterexample to the existence of a maximum. It is again easy to 
show that we cannot strengthen the conclusion of the preceding theorem to 
3x € [a, b\( f(x) = sup(/)). Let a be any real such that a < 0 or a = Ois 
unknown, and define f on [0, 1] by 


f(x) == ax. 

It is easy to see that sup(f) = max(0, a): if a < 0, then sup(/) = 0; if 
a=0ora> 0, sup(/) = a, and since, by 5.2.11(0x), “(a =O Va < OV 
a > 0), also —sup(f) = max(0, a). Hence, by the stability of equality for 
reals (5.2.10), sup(/) = max(0, a). Now assume that for some x & [0,1] 
f(x) = sup(f); then x > 0 V x < 1, which implies a > 0 or a < 0. 

The transformation into a refutation, on the basis of —SEP (cf. 4.3.6, 
5.2.12), is now routine. There is also a classical counterpart: we cannot 
determine the real x for which f(x) = sup(f) continuously in f. 


2. Differentiation and integration 
2.1. DEFINITION. f © R > R is differentiable at xq (with modulus a) if for 
some aE R 

WkWx(|xo — x| < 27% > a(x — x9) — (fe — fxo)| < 27*|x — xol)- 


The uniquely determined a is called the derivative of f at xo. More 
important is the following notion. 
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Let a<b; f€[a,b]—R is uniformly differentiable on (a, b], with 
derivative f', iff for some a 


Wkvx,y € [a, b](|x — y|< 27% > 
If'(x)(y — x) — (fr — fx) < 2-*Ly — x). 


For iterated derivatives we shall use the familiar notation: f© = f, f = 


fi for d= (FY, O 


REMARK. The notion “f is differentiable on [a, b]’, defined by Vx € [a, 5] 
(f is differentiable at x) does not lead to a smooth theory; we need the 
(classically equivalent) notion of uniform differentiability. 

We met with a similar phenomenon in the theory of continuous f € 
R — R, where the requirement of uniform continuity on all bounded closed 
intervals leads to a smoother theory. 


2.2. PROPOSITION. A uniformly differentiable f has a uniformly continuous 
derivative f', and is itself uniformly continuous. 


ProorF. Exercise. O 


2.3. Proposition. Let f, g be uniformly differentiable on bounded intervals 
with derivatives f’ and g’ respectively. Then 


(f+ 8)(x) =f'(x) + 8'(x), 
(f+ g)(x) =f'(x) - g(x) + f(x) - 8’(x), 

and if f has a positive l.u.b. on the interval I, then on I 
(PY) = -f'(@)/fY 

Proor. Exercise. O 


The elementary theory of differentiability is rather similar to the classical 
theory. We shall first establish a version of Rolle’s theorem. Though weaker 
than the classical version, it suffices for important applications such as 
Taylor’s series in 2.5 below. 


2.4. LEMMA (Rolle’s theorem). Let f be uniformly differentiable on [a, b], 
a<b, f(a) = f(b). Then inf(|f’) = 0, i.e. Vnax € [a, dK f'(x)| < 2~"). 
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Proor. Let m = inf{|f’(x)|: x € [a, b]}, this exists by 2.2 and 1.8. Sup- 
pose m > 0. Then in particular f’(a) < —m or f(a) = m; let us assume 
f(a) 2 m. Then for all x € [a,b] f’(x) = m. For if for some x in [a, b] 
f(x) < m, then f(x) < m — 27* for some k. Now apply the intermediate 
value theorem to f’ and [a,x]: there is a y such that |[f(y) —- 
(m — 27*)| < 27*"} ie. [f’(y)| < m, contradicting the definition of m. 

Let & be such that 

jx -— vy) <2" > 1f'(x)(y — x) — (fy - fe) s 3m, 

and choose p € N such that (b — a)p~! < 2~*, and let 
x,=ati(b—a)p' fori< p. Then0 = fb —- fa= 


n-1 

LY (fena1 — fx,) 

k=0 
n-1 
y fo, Gms pa x,) a 
k=0 


D (fe) ~ f(x) — fe) tear — 4) 2 


n=1 n-1 
Le m(Xe — Xe) — Le dm (xXp41 — X~) = (db — a) > 0. 
a =0 
This is contradictory, hence m = 0 and therefore by 1.9 the lemma follows. 


Oo 


2.5. THEOREM (Taylor series). Let f be n + 1 times uniformly differentiable 
on a closed interval [a, b] and let the remainder function R be defined by 


fat x)= ¥ (s(a)/ki)x* + R(a + x). 
k=0 
For all k there is ac € [a, b] such that 
(A) |R(b) — (b= e)"(b - a) f*M(c) ny s 2-*. 
PRooF, (i) If |b — a] = 0, we may take c= b= a. 
(ii) Let now |b — a| > 0. We put 
a(x) = f(b) — [f(x) + (F(x) /)(b — x) + + 
+(f(x)/n!)(b — x)")| — R(b)(b - x)/(b - a). 
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g is continuous and uniformly differentiable (2.2, 2.3), g(a) = g(b) = 0. 
g’(x) = {-£O(x) + (f(x) 0!) — (f(x) /1)(b - x) 
+ (f(x) /1t)(b = x) — (f(x) 721)(6 = x) 
+ (fO(x)/21)(b = x)? — (f(x) /31)(b = x) to 
+ (f(x) /(n = 1)1)(b = x)" 
— (f(x) /n!)(b — x)"} + R(b)/(b - a) 
= —(f0°?(x)/n!)(b — x)" + R(b)/(b — a). 
With Rolle’ theorem we find therefore a c € [a, b] such that 
Ie’(o) =|—( £7 (ce) /n!)(b — c)" + R(b)/(b — a) < 
2-*/(b- a). 


The preceding argument shows that 
jJb-al=Ov|[b-al#0- 


sup IRD) — (f"*%(e)/nt)(b - c)"(b - a)|) < 27" 


cel 


hence, since ——(|b — a| = 0 V |b — a| # 0), the assertion of the theorem 
follows. O 


2.6. LEMMA. Let f be n +1 times uniformly differentiable on [a, b], such 
that 


inf {  1r%c] >0. 


Then in each interval [c,d] C [a,b] with c > d there is an x such that 


f(x) # 0. 


PROOF. 


i=0 
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If c + x € [c, d], we have by Taylor’s development 
fle+ x) =f(e) + (fe)/1)x + 
(f(e)/2!)x? + --- +(f(c)/n!)x" + R(c + x), 
where 
[R(e + x) — (f*P(e)/(n + Y'I)(e + x — e)"x| <2-™ 


for suitable e € [c,c + x]. The functions f, f®,..., f“"* are uniformly 
continuous on [a, b], hence there is a D © R such that 


| f(x)| < Dforx €[a,b], i<n+l. 
On the other hand there is an i < n, such that 
|f(c)| > C/(n + 2) 


so f(c) # 0. If i = 0, we are done. So assume i > 0. Let c + x € [c, d] 
and put 


Q = fle) + fO(e)x/Ul + +++ +fO(e)x//il, 
QO’ = fED(c)x 1G + Itt --- +£(c)x"/n! + R(c + x), 
then 
fle+x)=Q+Q'. 
Assume | f(c)/i!{ > 2~* and choose x = 2~?; then 
[£9 (a) fil ears 
Also 
Q' <|x|-n-D- 2Fi=n-D-2-PGt), 
since we can approximate R(c + x) arbitrarily close by a term of the form 
(f%*Y(e)/ni)(c + x -—e)"x 
with e € [c, c + x]. By choosing p sufficiently large, we can achieve 
D -2-PUtD < 2-pi-k-2. 


Also {Q|>2-*-?'1 or |Q|<27*-?. If |Q}> 2°" 2-1 f(et+ x)= 
[]Q| — JO} > 2-* Pir} — 27k Pi? = Dk vi? TQ) < 27?! then by 


300 Some elementary analysis (Ch. 6 


\f (c)x!/il] > 27*-?!, there is jf <i such that |f(c)x//j!| #0; now 
repeat the argument with j instead of i. Thus in a finite number of steps we 
obtain |f(c + x)| #0 for suitable x. O 


2.7. THEOREM. Let f be a polynomial in x with {(0) < 0, {(1) > 0; then there 
is an x © [0,1] with-f(x) = 0. 


Proor. Let f(x) = a,x" +a,_,x""| +--+ +a . Since f(0) < 0 and f(1) 
> 0 we have a, <0 and a,+a,_,+--- +a )> 0, hence there is an 
i> 0 such that a; #0. If a, #0, f"(x) =a, +n! #0 and the hypotheses 
of 2.6 are fulfilled. If a; #0, i < n, then 


fO(x) = (al /(n - i)Ia, x" 
+((n-1)!/(n-—1-i)!)a,_ jx" 1 +--+ +a,i!, 
and 
(inf| f| > 2-2|a,)- i!) v (inf|f| < 274a,] + i!). 


In the first case we are finished; in the second case we find a j >i with 
a, #0 and can repeat the argument. Ultimately we find that f satisfies the 
hypothesis of 2.6, and so with 1.5 we find x with f(x) =0. O 


COROLLARY. R is real-closed field (cf. chapter 8). O 


Riemann integration (2.8-2.14). We shall show that for any closed interval 
[a,b] with a <b and every uniformly continuous f € [a,b] > R, the 
Riemann integral /?f exists. The treatment is rather similar to treatments in 
classical mathematics, so we give an outline only, leaving a lot of details to 
the reader as exercises. For the time being, [a, b] will be fixed. 


2.8. DEFINITION. The sequence P = (ap,...,a,) with ag=a, a, =b, 
Q9 Sa, < ++: <a, is called a partition of [a, b}; if a, = a+ 7b — a), 
r, © Q for i < n, we have a rational partition. n is called the length of the 
partition (d@o,...,a@,), and 


mesh( P) = sup{|a;,, — aJ:i<n-1}. 


A partition Q is a refinement of P if P is a subsequence of Q. O 
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NoraTIon. Let f be any uniformly continuous f on [a, 5]; we use S(f, P) 
to denote any sum of the form 

a-l 

DX f(x ais: — @;) with x; € [a;, a;,,). 

i=0 


If P is the rational partition (ay,..., a,,) with a; = a + i(b — a)n™, then 
n-1 
S(f, a,b, n) = (b ap an y fa) 
i=0 
is an S(f, P). 
2.9. THEOREM. Let f be uniformly continuous on (a, b] with modulus a. Then 


(S(f, a, b, n)), converges to a limit {°f. Moreover, for any rational partition 
P of [a,b], with mesh(P) < 2~** we have for all S(f, P) 


sty, P)- [| <2-*(b-a). 


PROOF. The proof heavily relies on the fact that all partitions considered 
are rational. Let P = (do,...,a,) be a rational partition of [a, b] with 
mesh(P) < 2~**, and let Q = (bo,..-., b,,) be a rational refinement of P. 
Let us write 2; for summation over the j with a, < b; < a;,,. Then 


IS(7, P) — S(F,Q)| 


a-l m-1 
2X fx Nain = a;) an 2 fyb = bj) 


a D Msd3 (be a 5) = pH Sif (yb iz 5) 


= BLf(s) — Hop) Il~ 8) 


n-1 
s L327"), — 6) =2-*(b - a). 
i=0 
Now let R be any other rational partition of [a, b] with mesh (R) < 2~%, 
and let Q be a common rational refinement of P and R; then 
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ISCf, P) — S(f, R)| < 27**(b — a). In particular 
|S(f, a, b, m) — S(f, a, b, m’)| < 27-**(b - a) 


for all m, m’ > (b — a)2**, hence (S(f, a, b, n)), is a Cauchy sequence, 
etcetera. O 


2.10. Lemma. Let P = (do,...,a,) be any partition with mesh(P) < e, 


then for any § > 0 there is a rational partition P’ of [ay, a,] with mesh(P’) 
<e, P’ = (a,..., a),), such that for all 


n-1 n—1 
S(f, P) = X fx Mains — 4), S(f, P= Y f(x )ainr - a7) 


i=0 
we have 
IS(f, P’) — S(f, P)| <6. 
PROOF. Exercise. O 
As a corollary we have 
2.11. THEOREM. For any S(f, P) with mesh(P) < 2~* we have 
b 
Iscs,P)— f'r 


The following propositions and theorem are also easily proved; their proofs 
are left as exercises. 


<2-*(b-a). O 


2.12. PROPOSITION. Let a <b <c, and let f, g be uniformly continuous on 
[a, c]. Then 

Gi) f(x: fty-g)=x- (ff) ty (fag); 

(iil) Vx € [a, cK f(x) < g(x)) > Jaf s Jog; 

dii) {of = Jef + {sf 0 


2.13. PROPOSITION. Let p be a non-negative polygonal function on [a, 5, 
then {2p is the area of {(x, y): x € [a,b], y € [0, p(x)]}. O 


2.14. THEOREM. Let f be uniformly continuous on [a, b]. Then g defined by 
g(x) = ff is uniformly differentiable on (a, b] with g(x) = f(x). O 
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3. Consequences of WC-N and FAN 


3.1. A spread-representation of {0,1}. We start with the construction of a 
spread-representation of the interval [0,1]. Every x € [0,1] can be ap- 
proximated by a fundamental sequence (a(n) -2~"~1), such that 


Wn(a(n) < 2"*1) A Wn((2a(n) — a(n + 1)| < 1). (1) 


To see this, choose a such that |x ~ a(n)-27"~1| < (5/8)-2-""}, a(n) < 
2"*1. Then a(n): 27"! — a(n + 1)-27"-7| < |x — a(n): 27" + 
|x — a(n + 1)-2>-"-4| < 5/8427"! + 27"), hence j2a(n) — a(n + 1)| 
< 15-273 < 2. Let T be the spread such that a € T iff (1) holds, and put 
fora € T 


x, = lim(a(n)-27-"71),,. x% = a(n) +2777}, 
View {x,:aEnAae€T} forne T. 


T, provided with the tree topology of 4.1.5 is (homeomorphic to) the Cantor 
discontinuum, and this representation of [0, 1] shows in fact that [0, 1] is the 
image of the Cantor discontinuum under a continuous mapping 9; note 
that V’ = g[V,,] (notation of 4.1.5). 

In fact, we can show a little bit more: [0,1] is a quotient of T (cf. 
Engelking 1968, p. 83). More precisely, we have the following 


3.2. PROPOSITION. 
(i) Va(x, € [0,1]; 
(ii) WnVa € T(|x, — x2] < 27~"7}), hence 
Wn(Vignaiy © U(2-"4, x,)) for a © T; 
(iii) Wx © [0, 1]da © T(x = x, A Wk(U(2-*-4, x) C Vi,)). 


PROOF. (i) is obvious; for (4i) observe that 


Xo = xal < >: core _ ela | < > Qo-n-k-2 < Qoa-l 
k=0 k=0 

(iii) Suppose |x — y| < 2~*~5, x, y € [0,1] and determine a, 8 € T such 
that for all “a |x — a(n)-27-""}) < (5/8)2-""}, |y — B(n)-2-"" 4] < 
(5/8)2-"7}, then 2-*“a(k) os 2-*-2B(k +1) <|x- 2-*-lo(k)| + 
ly — x| + |y — 27-*-*B(k +1] < (5/8)-27*-} + (5/8)-2-*-27 + 2-*-S 
= 2-1 50 Ra(k) - B(k + DI <1. 
Therefore y = a(k + 1)*An.B(k +14 2) © T,and x,=y © Vigsy. O 
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N.B. Property (iii) is the quotient-property. 


It is easy to draw a picture of T and the associated sequences 
(a(n)2~"~1),. The infinite branches in fig. 6.2 correspond to the associated 
sequences. The branches passing through k-2~” represent the interval 
[(K — 1)27-",(k + 1)27"] / (0, 1). 


3.3. PROPOSINON (WC-N). Let {X,:n € 1} with ICN be a cover of R, 
then {Int(X,):n GI} is again a cover (and similarly for any segment 
instead of R). 


ProoF. Without loss of generality let J = N (if necessary we can replace 
{X,:1 © I} by {X,:n © N} with X) = {x: x © X, An € 1}). It suffices 
to show that for any cover (X,), of [—m, m], (Int(X,)), also covers 
{—m, m], and for this in turn it is enough to consider covers ( X,,),, of (0, 1]. 
Therefore, let [0,1] Cc U{ X,: n © N}, ie. 


Va € Tan(x, € X,). 
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Let a7, 8; range over T (notation as in 4.1.2), then with WC-N 
Va;dImnVB, € am( xp e X,), 
so 
Vardimnvx € Vi,(x € X,). 


Now choose, by 3.2(iii), for any x €[0,1] an @ such that x =x,, 
Wn(U(2-"~4, x,) C Ve; then Vxdadmn(U(2~"~4, x.) CX, A xy = x). 


n 


Hence each x belongs to the interior of some X,. O 


3.4. THEOREM (WC-N). All mappings f © X > R, X a segment, are con- 
tinuous. 


Proor. Fix k EN, let (7,),, be a fixed enumeration of Q, and let 
X= (x: [f(x) — ml < 274). 
Then (X,,),, covers X, hence (Int(X,,)),, covers X, ie. 
wx © Xamvy € U(2-", x) A X(I f(y») -— | < 27), 
hence 


Wx © XamVy © X(|x — y| < 27" > [ f(x) -f(y)| <27**1). Oo 


The next few applications require FAN. 


3.5. THEOREM (FAN; compactness of the interval). Let (a, b] be a segment 
with a < b, and {W,:i © 1} an open cover of (a, b]; then there is a finitely 
indexed subcover W,, wo+y W;. 
PRooF. It suffices to prove the theorem for [0,1]. As noted in 4.7.3, FAN 
expresses compactness of T, and since we just constructed in 3.1-2 a 
continuous mapping g» from T onto [0,1] and compactness is clearly 
preserved under continuous mappings, [0, 1] is compact. 

The argument may be spelt out more formally as follows. We obviously 
have 


Va,;dm3i € I(U(2-", x.) € W,). 
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If U(2~™, x,) © W,, then because of x, © Vz, C U(2~™, x,) we find 
Va,amai € I(Vi,, c W,). 
By FAN we find a & such that 
Va7di € 1(Vi, c W,). 
There are finitely many possible values for ak; selecting a W, for each ak 


we find {W,,,..., W,,} covering [0,1]. O 


3.6. THEOREM (FAN). Each continuous f € [a,b] > R is uniformly con- 
tinuous. 
Proor. By the continuity of f, for fixed n = N 

Va,am3i( f[U(2-”, x,)] © U(2-", )), 


where (r,), is a standard enumeration of Q as before. Hence, since 
x, € Vz, © U(27", x4); 


Va,imii( f[V;,,] ¢ U(2~", 7). 
Thus we find a k such that 
Va73i( [VE] Cc ue, r;)). 


Suppose |x — y| < 2~*~4; choose x = x, such that Vm(U(2~"~4, x) C 
Vin), then in particular y © V,, f(x) =f(x,) € U2", 7r), fly) € 
U(2~", r,) for some i, and thus | f(x) — f(y)| < 2~"*1. Therefore 


[x —y]< 2-* 4 > f(x) — f(y) < 2-7". a 


3.7, THEOREM (FAN). Let f € [a,b] > R be continuous. Then 
(i) fhas a supremum, 
(ii) Wx © [a, bl f(x) > 0) > inf(f) > 0. 


PRroorF. (i) Immediate by 3.6 and 1.8. 

(ii) The cover (W,,),, with W, = {x: fx > 2~"} is an open cover since f 
is continuous; by 3.5 we find a subcover W,,...,W,. Take n= 
max{7,...,,}, then Vx € [a, b( f(x) > 27"). O : 


3.8. FAN* combines FAN with WC-N, i.e. FAN* is 
Va,dnA(a,n) > InVa;imVB € a(n) A(B, m). 
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Assuming FAN* we can strengthen 3.6, 3.7 by dropping the condition that 
f has to be continuous: 


THEOREM (FAN*), Let f € [a, b] — R, then 
(i) fis uniformly continuous and has a supremum. 


(ii) Wx € [a, b](f(x) > 0) > inf(f)>0. OB 


The first half of (i) is often called “Brouwer’s continuity theorem”. 


4, Analysis in CRM: consequences of CT), ECT), MP 


In the present section we shall demonstrate some of the striking divergences 
from traditional analysis in CRM, such as the existence of continuous, 
unbounded (hence not uniformly continuous) functions on [0, 1], uniformly 
continuous functions not assuming their supremum on {0, 1], etc. 

The principal tool in many of these constructions is the notion of a 
singular cover. 


4.1. DEFINITION. Let us denote the length of an open interval or segment / 
(which by definition contains its endpoints) by |/|. A sequence (J,),, is a 
cover of X C R iff Vx © Xdn(x € I,). (1) is an interval cover if each J, is 
an open interval with rational endpoints, of finite length, and a segmental 
cover if each I, is a segment of finite length with rational endpoints. 

A sequence (J,,),, of intervals or segments is called e-bounded iff 


vn ¥ 1, <¢]. 


n=0 


The cover of an interval (a, b), a segment [a, b] (resp. R) is singular iff it is 
e-bounded for some ¢ < (b — a) (resp. some €). 

An interval cover (J,,), is universal iff for each sequence (J,), of 
intervals of finite length with rational endpoints such that lim(|J,|),, = 0 we 
have Inm(J, = I,,). 

A cover is said to have finite overlap if each point belongs to finitely 
many elements of the cover only. O 


4.2. THEOREM (CT,). For each e > 0 a universal e-bounded interval cover of 
R exists (hence such a cover also exists for intervals and segments). 


308 Some elementary analysis [Ch. 6 


Proor. Let An.b(m, n) enumerate all rational intervals of length < 2~”, 
and let @ be the partial function given by (m) = {m}(m). Let y enu- 
merate the domain of » without repetition (such a y exists, why?). Put 


I, = (no + ym + 1, (ym)), 


where 27° < e. Then 


k k k 
_ Lal < > Q-(motymtl) < p™ 2 (mot m+) < 27%, 
m=0 


m=0 m=0 


For the second inequality, arrange the numbers {y0,..., yk} according to 
magnitude, say as my < m, < +++ m,; and observe p<m, for p <k, 
hence 2~("ot™» +) < 2-(otP*D) for p < k). 

Now we prove universality. Let € enumerate rational intervals, with 
lim(|én]),, = 0; suppose 5 to be such that |&(6n)| < 27", and let ¢’ = 
An.&(6n). Then Wndm(é'n = y(n, m)), hence for some 6’ Wn(§'n = 
wW(n, 5’n)), therefore 


E(ngt n+l) =v(nygtnt+1,8"(no+n4+1)) 


where 6” = An.min,,((n, m) = &’n). By CT) we have for some k {k} = 
Am.8'"(ny + m + 1). 8” is total, hence @ is defined for k, and k = ym for 
suitable m. Thus 


E(ng tym +1) = (ng t ym t 1,8"(ng + ym + 1)). 


o(k) = {k}(k) = 8(ny + ym + 1), hence (ng + ym + 1) = (ng + ym 
+ 1, y(ym)). Finally, we have to show that (J,),, covers. Take any x; we 
can find (J7),, with [Jj] < 27" such that Vn(x © J). By the universality 
dnm({, = 1,,). O 


4.3. REMARK. If (J/,), is a singular cover of [0,1], say, no finite part 
{1o,---, 1} covers [0,1]; in fact, we can easily find a rational r € [0,1] \ 
(I) U ++: UI,,)~. To see this, observe that a rational segment of length 
< k-n~} can contain at most k + 1 points i-n~' with i < n (k — 1 points 
in the interior and 0, 1 or 2 points at the endpoints of the interval); and if 
[Ip] + --- +[1,,| < k-n71, at most k + n+ 1 points i-n7' can be covered 
by (J) U --+ UI,,)~. For n sufficiently large we can therefore find an r of 
the form i-n~! € [0,1]\\() U +++ UI,,)~. Thus the singular covers strik- 
ingly demonstrate the non-compactness of the interval in CRM. A stronger 
result is established in 4.9. O 
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4.4, THEOREM. (CT)). 

(i) There exists an f on{0, 1] such that f is uniformly continuous, Vx( f(x) > 0), 
inf( f) = 0; i.e. f does not assume its infimum. Similarly we can construct 
a uniformly continuous f which does not assume its supremum. 

(ii) There exists a continuous function on [0,1] which is unbounded (i.e. 
Wn4x( f(x) > n)) and hence not uniformly continuous. 


Proor. (i) Let ¢J,,),, be a singular interval covering of [0,1], and suppose 
I, = (", S,) (see fig. 6.3). Let f, be the uniformly continuous function such 
that 


f,(x) = max{0,4(r, — s,) —[x - 4(r, + 5,)|}. 


Then 
x)= L243) 


is everywhere defined on [0, 1] (observe that | /,(x)| < 1 everywhere), and is 
in fact uniformly continuous. For if |x — y| < 6, then |f,(x) — f,(y)| < 6 
for all n, hence 


o ioe) 
FOI) ss LLC) HL) 22s 8-2 278 = 28. 
n=0 n=0 
(Why does the first inequality hold?) ¢J,,), covers, hence for each x in [0, 1] 
x € I, for some n, i.e. f(x) > 0, hence f(x) > 0. On the other hand, since 
no subset {J,..., 1} covers, we can pick x € [0,1])\ (J) U --- UT,), so 
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f(x) = 0 for i <n, and 


fa)= 2. 24) so": 
k=n+1 
Therefore inf( f) = 0. 
(ii) Take g(x) = f(x)~! where f is as constructed under (i). O 


We can also construct singular coverings with finite overlap, via the 
following 


4.5, LEMMA. Let (J,,),, be an interval-covering of X, where X =R or a 
rational segment of finite length. Then there is a sequence (J,),e7, 2 CN, 
Z an inhabited index set such thatn © ZA m<n-—mé Z, consisting of 
rational segments such that 

(i) Wn e ZAm(J, CTT) («J,)n refines an) 

(ii) Vasme Z(J,C J, 7 n=m 

(iii) Wnym © Z(n #m-— Int(J,) a Int(J,,) = 2); 

(iv) Wx € XAkIn,m © Z(U(2~*, x) NX C (J, UI,,)7); 

(v) If no finite part of (I,,), covers X, then Z =N; 

(vi) If (1,), is e-bounded, then so is (J,)n; 

(vii) If Z = N and lim(¢|J,]),, = 0, then lim(|J,|), = 0. 


Proof. Let (J), be as indicated. Suppose Y’ = {Jo,..., J, } to have been 
constructed such that (i)-(iii) hold with {0,...,&} for Z. 

Then Y= X\ Int((Jg U +++ UJ,)~) consists of finitely many disjoint 
segments, say Yp,..., Y,,. Now let J be any rational segment from ¢J,),, 
and put J’) =f’ U {Y,N 1: Y, 1 inhabited, i < m}; clearly, Y’(/) 
again satisfies (i)-(ili). We put 


Fo = {lo}, First =f [1 nel SHU YZ, sn ]N}. 


We can enumerate Y by successively enumerating JY, J; \Lp, 
22 \f1 ++. » Since we cannot exclude the possibility that, for some k, for all 
n>k $,.,\¥%, is empty, we can only state that the enumeration runs 
over an inhabited downwards monotone index set Z. We leave (v) as an 
exercise. 

(vi) and (vii) are also obvious from the construction. It remains to prove 
(iv). Consider any x € X; x EI, say. Let J, = {Jo,...,J,}; by the 
construction (UY,)"= (I) U--- UL)”. Enumerating Y, from left to. 
right gives a rearrangement Fi (0) Yo(ty> +--+» Jacky: Clearly, x © J, implies 
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x € Int((Je¢ U Jocie1y)) for some i < k (cf. fig. 6.4). 


(|, —______________, 


Fig. 6.4 


Jos Jus Ja, Fg E Fn 13 {Ips Jpg te Spa Ip43} =Ia\Fa-- Os 
Note that in this case the “overlap” of the J; is at most one point. 


REMARK. A sequence of rational segments (J,), with property (iv) is 
sometimes called a quasi-cover. For a rather similar lemma, see E6.3.6. 


4.6. PROPOSITION (CT,). For X =R, or a rational segment and for any 

meéeN: 

(i) there exists a 2~™-bounded quasi-cover (I,,),, of X with lim¢|J,|),, = 95 

(ii) there exists a 2~”-bounded interval cover (J,,),, with lim<|J,|), = 0 and 
finite overlap, in fact: 


Wx © X3kn nynWny(y € U(2-*, x) NJ, on © (m, 15,73}). 
Proor. (i) Apply the preceding lemma to any singular cover as given by 
2 


(ii) Construct a 2771 quasi-cover as under (i), and let (J,),, enumerate 
without repetitions the open intervals Int((J, U J,,)~) for those J, and J,, 
which have an endpoint in common. Then (J,),, is a 2~ "-bounded interval 
covering satisfying (ii) (exercise). O 


4.7, DEFINITION. (x,,),, iS Said to be a strong Specker sequence iff (x,,), 18 
monotone, bounded and satisfies 

VxamkWn > k(x,<x-27™Vx,>x+27") 
or equivalently VxdmkVn > k(x, €(x-—2°-",x+27")). O 


Of course, the limit of a strong Specker sequence exists as an element in 
Re, which lies apart from every real. 


4.8. PROPOSITION (CT,). Strong Specker sequences exist. 
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First proof. Let (1,,), be a quasi-order of (0, 1] as indicated in the preceding 
lemma. Consider Int((0,1)\ (Ip U 1, U +: Ul,)~); this will consist of 
finitely many open intervals. Let y(n) be the leftmost endpoint of these 
intervals. We will show that y is a strong Specker sequence. Obviously, 


Wn(0 < yn < y(n +1) <1). 
Now let J, = [r,, 5,], and suppose 
Cet bed, 


Find & such that 
FP Sa = 3-27, 
Oe a et RP a 


Noting that 1- 4(1—+7,)= 4+ 4, 1-40 —7,,) =4 + 47, we see 
that 
x < 4s, Vv (4s <x<4+ 1p) v v (1+ 4, < x). 


no 30 ny 35m 
In the first case for n > no, and in the third case for n > n, 
y(n) € (x — 27", x +2-*). 


In the second case, let U(2~*, x) 0 [0,1] ¢ U,, U I,,)”- So for that case 
y(n) € (x — 27*, x + 27*). Hence, for n> max(n,, Ny, N>,N;), Yn)E 
(x - 27%, x +27 “ky, oO 


SECOND PROOF (sketch). Let (67), be the Specker sequence of 6.4.4. Take 
any x & [0,1], and let p(x, C) be the distance of x from the Cantor set C, 
which exists by E6.1.9. 

Making use of the fact that for all n © N p(x,C) <n7! or p(x, C) > 
(n + 1)7! we construct a sequence (5,),, in C such that for all n 


p(x,C)>(n+1)* and 6,4,=5,, or p(x,b41) <no7t. 
The b, converge to a limit in C, say b. For this b ImkWn > m(|b — Sn| > 
2-*). If |x — b| < 2~*, we are done; if not, we find |x — b| > 2~*~!, hence 
at some stage of our construction of (b,) p(x,C) > 27*, and then also 
Vn(|x — én] >2-*). O 


Strong Specker sequences can be used to give another proof of theorem 
4.4. 
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At first sight it might appear as if the existence of singular covers 
prevents the development of a good measure theory in CRM, since the 
interval can be covered by a collection of intervals with arbitrarily small 
measure! However, the difficulty disappears if we realize that for singular 
covers (J,,),, the sum 


foo) 
pare 
n=0 
does not converge to a real; if we require this sum to have a limit in R, the 
cover cannot be singular. We shall prove this in 4.10, but we first need a 
lemma. 


4.9. LemMA. Let (I,), be a sequence of intervals contained in [0,1] such that 
for some 5 > 0 


|I,,| converges and is less than1 — 6,5 > 0. 
f) 


Then there is an x not covered by (I,,),, in fact 
ax € [0, 1]wnVy € I,(|x — y| > 277776). (1) 


Proor. Assume the hypotheses of the theorem; without loss of generality 
we may assume the J, to be rational (why?). Let J be a rational segment, 
then it is easy to see that also 


oO 

d | 29 1, converges; 

n=0 
we write ®(/) for the limit. If J =[a, 5], let I’ :=[a,}(a+ 5)], I’ = 
[3(a + b), b]. Then (7) = O(7’) + O(1”) (by a straightforward argument 
about convergent series), and if ®(/) <|J|, then either ®(J’) < |J’| or 
@(1") < |I"|. Therefore, starting from Jy = [0,1], which satisfies ®(J)) < 
|Jo| = 1, we can construct a sequence (J,), such that for all n 


lJ = ay G(J,) < lJ,1 J+) Cc J 


Clearly there is a unique x € [0,1] such that Vn(x © J,,). x is not covered 
by (/,,),; for if x € I, then, since J, is open, for some m J,, C J. But then 
®(J,,) = |1,,, which is impossible. 
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We can now easily refine the argument to prove (1). For suitable e we have 


ioe] 
Y |Z,)<1-8-.«. 


n=0 


Let, for J, = [a, b], I, = [a — 627""7,b + 8 2>-"-7), then 


Yj<1-e. 

n=0 
The preceding argument now applies, with e for 6 and (J’), for ¢J,),3 
hence (J,,),, does not cover x for some x, and (1) holds for this x. O 


4.10. COROLLARY. 
(i) Jf ¢1,), is a singular cover of [0,1] then 


(LIK), 
k=0 


is a Specker sequence. 
(ii) Let (x,,), be a sequence of points in [0,1], then 


ax € [0,1]Vn(|x - x,| > 27773). 
(iii) For any finite sequence (Xo,..., X,) 


ar € ON [0,1]vi < k(|r — x,| > 2-*73). 


PROOF. (i) is immediate. 

(ii) Choose a sequence (J,,), such that |J,| <2~"~*, x, € J, and apply 
4.9. 

(iii) is immediate from (it). O 


As the following theorem, due to Kushner, shows, for Riemann integra- 
bility on an interval it is not sufficient that a function is continuous. This is 
an application of the existence of strong Specker sequences. 


4.11. THEOREM (CT,). There is a function f, continuous on [0,1] which is not 
integrable for any notion of integral ¥ such that 

G) YF has the usual value for polygonal functions; 

(ii) ¥ is monotone, i.e. if f => 0 on [0,1] then ¥(f) = 0; 

(ili) ¥(f) — ¥(g) = #(f — 8). 
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(nr 1) (5,1) 


—_ _— 9-4 
a, =r, — 2 r Ss 


n n 


S, + 2°" 4 = bh 


Fig. 6.5 


PROOF. Let y be a strong Specker sequence such that 
O<yn<y(n+1) <1, 


and let (J,,), be a quasi-cover of [0,1] with bound }; let /, = [r,. s,]. We 
put 


[a,,5,] = [r, —2-"-4 5, + 2-8-4] 


and let F) be the function on [0,1] as indicated in fig. 6.5. 
We write F}(n, x) = F(x); F} is a polygonal function for all n. 


F?(n, x) =1- min 1, x Fk} 
k=0 


we write F? for \x.F?(n, x). F? is again polygonal on [0,1], and the 
slopes are given by rational coefficients. We have 

0< F?(x) <1, 

F?(x) =0 forx€()U--- UL), 

F(x) =1  forx € U{[a;, 6):i<n}. 
Let G(n) be the integral of F on [0,1]; G(n) > | since 


Wipe Sy Gana y ets, 
k=0 k=0 k=0 

Now put 
FX0,x) = (y(0)/G(0)) « F2(0, x), 


F?(n + 1,x) = (y(n + 1) - y(n))/G(n + 1) - F?(n +1, x), 
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and let F? = \x.F3(n, x). F? is polygonal on [0,1], and we observe that 
0 < F3(x) < 4- (0), 
0 < F3(x) s 4(y(n) - y(“- 1)), 
F3(x)=0 forxe€ (1)U--- Ul,) , 
y(0) = §( F3), 
y(n +1)= y(n) = I(F.,). 
Finally, let 
n 
F(n,x) = > F7(i,x), F,=Ax.F(n, x). 
i=0 
Then 
0 < F(n,x) <4-y¥(0)+ VY 4(y(i) — yi - 1)) =4- y(n) 
i=l 
and therefore 
Vx © [0,1]vn(0 < F(n, x) < 4). 


For each x € [0,1] there are 1,, n, such that 


xE€E ([r,> 5] U en Sn, ) 
; _ 3 ots 
with Say = Tay Tay <% < Say Then F°(n, x) = 0 for n > max(n,, n,), The 
function 


max( 1, 22) n 
F(x)= YO F%(i,x)= ¥ F%(i, x) = F(n, x) 
0 


i=0 i= 
is therefore everywhere well-defined, and 
F(x) = lim(F,(x)),- 


F is not integrable, for suppose %(F') = z. On the one hand, F(x) < F(x) 
for all x and for all n EN, so, since %(F,) = y(n) (additivity of %), by 
monotonicity Wn(y(n) < z). y is a strong Specker sequence, hence 


dv € Nan'Vn > n'(y(n) sz -27”). 
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Assume now that there exists an n’ such that 

Vn > n'Wm(y(n + m) — y(n)) < 27°73, 
then for all k 


m+k+t n+k+t 
Pie) 245) 2 GG) 4G =) 
i=n'+1 i=n’+1 


=4(y(n’+k+1)—-y(n’)) < 2777). 
Therefore, for all k 
F(n’ +k+1,x) < F(n’,x) +2777! 
and thus 
F(x) < F(n’,x) +2777). 
However, the integral ¥ is additive and continuous, hence 
z<y(n’) +2777, 
contradicting our assumption. Thus, if F is integrable 
—dn’mn(n =n’ > y(m + n) -— y(n) < 27°73) 
and this conflicts with the boundedness of y. O 
An interesting application of strong Specker sequences is the construc- 
tion by Demuth (1969) of a uniformly continuous function on [0, 1] which is 


everywhere differentiable classically, but in no recursive real has a recursive 
derivative (cf. also Myhill 1971). 


4.12. Positive consequences of ECT, + MP. So far, our results concerned 
“pathologies” occurring in CRM; these could be deduced on the basis of 
CT, alone. On the basis of ECT, + MP we can also prove positive results 
such as “all functions in R — R are continuous”. We need some pre- 
liminaries before we can present a proof of this result. 

Let (r,),, be a fixed standard enumeration of Q, and let us write q, for 
the real determined by a fundamental sequence (7;,.),,)), Satisfying 


Yn (lr eyee) ~ Myra l < 27 (1) 
Then 
Wn(l4x — Mem! < 2"): (2) 
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Let R* denote the set of code numbers x satisfying Vy3uTxyu and (1); 
x & R* can be expressed by an almost negative formula. 
If FER, 0 ER, Walle — rey < 277%, Walt! = ry yuml < 277-7), 


|t — t'|< 27"~?, and z is code for {x} (n)*Am.{ y}(n + m), then z © R* 


and t’ = q, = q,, since |r7.)¢n-1) — Toys ds toajtenl 14 — al + 
Id — "yyoml < 27" 14 2-4-2 4 2-2 = 2-" Hence 


Vie Rax © R*Vi © RVn EN([t— 1']< 2°"? 5 


az ER*(1= 9, At = 4.4 {z }(n) = {x }(n))). 


(3) 
We are now ready to prove the 


THEOREM (ECT, + MP). All f € R — R are continuous. 


Proor. The argument is quite different from the proof of the correspond- 
ing result on the basis of WC-N. Let fe R >R; then Vx,y ER*(q, = 4, 
> f(q,) = f(q,)), and Vx © R*3y © R*(f(q,) = 4,). Since, as noted 
above, x © R* is almost negative, there exists by ECT, a z © N such that 


Vx © R*({z}(x) ERA Gay =f(4x))s (4) 
Vx, ye R*(q, = qy = 9(2\(x) = Vz\)) (5) 
Let us write zo(x, n) for "((z}(x)(ny: Let Tp be the spread of sequences a 
satisfying Vn(|ren — Tacn+iy| < 2 "~*), then there is a recursive @ enumerat- 


ing a set of codes for functions dense in Ta, i.e. the collection {{ On}: 1 © N} 
is dense in Ty. Fix vy € N and let — be partial recursive such that 


{y}(x) if Vn < x-(Tkkn A Un = 0); 
{Om}(x) if u = min,(Tkkn A Un =0) < x 
{(z, y, k)}(x) = Am =min,| {On} © {y }(u) 


A zg v) — 29(6n, ») | > ee 
undefined otherwise. 


We write p(y, k) for p(z, y, k). Let p be partial recursive such that 
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0 if E({z}(~(y.k))) A 
lzo(o(y, k),v) - zo(y, v))| ener 

{v(z,y)}(k)=41 if E({z}(p(y, k))) A 
|zo( oy, k),v) — z9(y, v))| S25 eh 


undefined otherwise. 


We write Wy for Y(z, y). Then 


{ty }(yy) = 0. (6) 


For if not, then Vn—(T(Wy, Py, 2) A Un = 0), hence { p(y, by)} = {y}, 
SO ocy,gyy = 4 and |Zo(P(y, Hy), ¥) — Zo(¥, ¥)I S IZo(PO, Wy), ¥) — 
Iecy,yy)l + lZQ(¥ ¥) — gy < 27°77; but then {yy }(yy) = 0 by definition; 
hence ——={ Py }(Yy) = 0 and thus by Markov’s principle (6) holds. 

Let m be a Gédel-number for A y.min,T(Wy, Py, v) and put M, for 
{m}(y). Note that 


y €R* > U(M,) = {by} (dy) = 0. 


We now show that for any n 
(9m }(M,) = Ty }(M,) |z9(0n,r) ~ 29(y. 2) <2. (7) 


For suppose {@n }(M,) = {y }(M,) and |z9(8n, v) — zo(y, »)) = 2771. 
Then there is also a /east n satisfying these conditions, say n’, and for this 
n’ we have dyiy yy) = on, and therefore |zo(p(y, Py), ¥) — Zo(¥, P= 
2-”*!. This means that {~y}(Wy) = 1, by the definition of ¥; contradic- 
tion, hence (7). 

Similarly for any w, 


{4n }(M,,) = {w }(M,) > |zo(On, 7) — zo(w,v)|< 27-77", (8) 
and therefore, if {y }(,) = {w }(M,), let m’ = max(M,, M,) and 


choose n such that {@n} © {w }(m’); then (7) and (8) both hold for n, and 
thus 


{y }(4,) = Gr }(M,) +299, ») - zo(w, 2) | < 27742, 


and so 


y} (M,) a {w (M,) = 19¢2}(y) = Gi 2}(wy| < 7 tas + 20. (9) 
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From (7) and the observation (3) above we see that 


VkWt Ee R3nVr ER(ft-—t|< 2°" |ft-ft']}<2-*). oO 


REMARK. The theorem may also be seen as a classical result about recur- 
sively given real functions on [0, 1], where “recursively given” means that a 
z can be found such that (4) and (5) hold. The only non-classical element is 
in the use of ECT) to show that all real functions on [0,1] are recursively 
given. 


4.13. Recursive analysis. This approach has a somewhat hybrid character; 
problems with parameters are required to be solved recursively in the 
parameters, but in other respects classical logic is applied. Specifically, a 
sentence 4xAx may be proved by classical means, but for a formula with 
parameters, e.g. JxA(x, y) one requires a proof giving x recursively in y. 

Thus a bisection argument can be used to show that for any specific 
continuous f with {(0) < 0, f(1) > 0 there is indeed a (recursive) real x 
with f(x) = 0; but there is no solution recursively in data for f. The second 
assertion will be obvious from the discussion of the counterexample in 1.2; 
as to the first, we argue as follows. 

Let f map recursive reals to recursive reals, and suppose (0) < 0, f(1) 
> 0. If f(4) = 0, we are done. If (3) # 0, then, by Markov’s principle, 
either f(4) > 0 or f(4) < 0. Suppose f(3) > 0; we test f(4). If this is zero, 
we are done; if not, we continue as before. Thus we either meet a bisection 
point of the form k2~" with f(k2~”") =0, or for all bisection points 
considered f(k2~") #0; in the latter case the recursive sequence of 
bisection points has a recursive real x as a limit for which f(x) =0. O 


5. Notes 


5.1. The contents of section 1 are on the whole standard. The proof of 1.3 
is found in Grayson (1981, 7.2) as well as in Mandelkern (1976). The proof 
of 1.4 is due to J. Mines and F. Richman (in Mandelkern 1982); earlier 
proofs assumed uniform continuity. Theorem 1.5 is exercise 13 in chapter 2 
of Bishop (1967). 

The proof of 2.4 follows Bishop and Bridges (1985). 2.6 and 2.7 corre- 
spond to exercises 14 and 15 in chapter 2 of Bishop (1967). The treatment 
of Riemann integration is standard. 
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5.2. Already Brouwer (1919, 1924) gave standard representations as con- 
tinuous images of Cantor spaces (“finite Mengen”, “fans”) similar to the 
one presented in 3.1 for [0,1]; the particular presentation given here is the 
same as in Heyting (1956, 3.3.3-4). 

3.6 and the first half of 3.8(i) are already stated by Brouwer (1924); since 
Brouwer did not isolate the role of the principle WC-N from FAN and 
FAN*, we do not find in his writings analogues to 3.3, 3.4. 

3.3 and 3.4 for complete separable metric spaces appear as 2.21, 2.23 in 
Troelstra (1966), and also in Troelstra (1967). 

3.5 is a special case of the intuitionistic “Heine—Borel” theorem proved 
by Brouwer (1926A; Heyting 1956, 5.2.2). 

The theorem of E6.3.8—9 is due to Veldman (1981, in the leaflet “Stellin- 


gen”). 


5.3. Singular interval coverings were discovered by Kreisel and Lacombe 
(1957), and independently by Tsejtin and Zaslavskij (1962). 

Theorem 4.2 is found in the latter paper (cf. also our remarks in 5.8.3). 
Singular covers show that in Constructive Recursive Mathematics [0,1] is 
not compact; for Cantor space (the binary tree) this is an immediate 
consequence of Kleene’s recursively well-founded, not well-founded recur- 
sive tree (4.7.6, theorem). 

Julian and Richman (1984) give information on soot of 4.4(i) in the 
literature; in addition they show that in BCM the statement “each uni- 
formly continuous everywhere positive function on [0,1] has a positive 
infimum” is in fact equivalent to the fan theorem. 


5.4. A proof of 4.8 is contained in the proof of theorem 4.3 of Zaslavskij 
(1962). In our presentation we follow Kushner (1973, section 8.1, theo- 
rem 6). 

4.9 and 4.10(i) are taken from Tsejtin and Zaslavskij (1962); our exposi- 
tion follows Kushner (1973, section 8.1, lemma 1). 


5.5. The KLST-theorem. Tsejtin (1959, 1962) proved that in CRM every 
mapping from a complete separable metric space into a separable metric 
space is continuous; the special case of mappings from N“ —> N (E6.4.9) 
was discovered independently by Kreisel et al. (1957, 1959). Thus the 
frequently used name “Kreisel-Lacombe-Shoenfield theorem”, abbrevi- 
ated KLS, is perhaps more appropriately named the “Kreisel-Lacombe-— 
Shoenfield—Tsejtin theorem” (KLST). 
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The proofs use the crea‘.vity of the diagonal set {x : dy7xxy}, avoiding a 
direct appeal to the recursion theorem; our proof in 4.12 for the special, but 
representative case of mappings from R into R is based on the same idea. 

Gandy (1962, for mappings in N“ > N) and Y. Moschovakis (1964, the 
general case) give proofs based on the recursion theorem. A proof for the 
general case of mappings from complete separable metric spaces to sep- 
arable metric spaces based on Moschovakis’ idea is found in the exercises 
E7.2.2—5. Kushner (1973, chapter 9) contains some illuminating comments 
on the method of proof. 

Finally, it is to be noted that as early as 1954 Markov had obtained a 
weaker theorem to the effect that mappings from R to R cannot have 
positive discontinuities (for an exposition see Kushner 1973, section 9.2, 
theorem 6). 


Exercises 


6.1.1. Let us call a function fe X > R (X CR) strictly monotone if Vx,y © X(x < yo 
fx < fy). Show that a strictly monotone and continuous f € [0,1] > R satisfies the inter- 
mediate value theorem in the strong form, i.e. for all cE R (f(0) < ¢ < f(1) > Ax(f(x) = ¢). 


6.1.2. Let us call fe XR, XCR, strongly monotone if Vx,y © X(fx < fy > x <y). 
Show that a strongly monotone and continuous f € [0,1] ~ R is uniformly continuous 
(Mandelkern 1982). 


6.1.3. Let (0, 1]"¢ = {x € R°°:0 < x < 1}. Suppose f € [0,1] > R to be uniformly continu- 
ous, f(0) < 0, f(1) > 0; show that for the canonical extension f* € [0,1] > R® of f we 
have 3x € [0, 1]>&( f*(x) = 0). 


6.1.4. Give an example of an f € [0,1] -~ R which assumes its local suprema (i.e. suprema of 
f on sufficiently small segments) but which cannot be shown to assume its supremum on (0, 1]. 


6.1.5. In the notation of exercise 6.1.3 above, show that for a uniformly continuous f € [0,1] 
> R one has 3x € [0,1]*(/*(x) = sup(f)). 


6.1.6. Show by means of a weak counterexample that (0, 1] U [1,2] is not closed (in particular, 
we cannot show [0,1] U [1,2] = [0,2]). Strengthen this under the assumption of —SEP to a 
refutation of “for all segments J,, J,, if J, Q J, is inhabited, then J, U /, is a segment”. 


6.1.7. Show that [0,1] U [1,2] is a continuous image of [0,1] (therefore we cannot expect to 
prove constructively that the continuous image of a closed set is always closed cf. exercise 
6.1.6). Is compactness (in the sense of “every open cover has a finite subcover”) preserved 
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under continuous mappings? Indicate a property distinguishing [0,1) U [1,2] from [0,1] U 
{l, 2). 


6.1.8. Let X C R be totally bounded. Show that X is located, i.e. a distance function p(x, X) 
for x © R is defined such that Vy © X(p(x, X) <|y — x)) and Vkay © X(p(x, X) < |x — y| 
+274), 


6.1.9. The reals of the form 


ive} 
Y 2e-37" 1, 
n=0 


with a © N > {0,1}, constitute the Cantor discontinuum C as a subset of [0,1] (obtained by 
the well-known procedure of repeated deletion of the open middle third of segments). Show 
that C is located (cf. the preceding exercise). 


6.2.1. Prove 2.2. Hint: for the second part use the first part to conclude that sup] f’| exists on 
[a, 5}. 


6.2.2. Prove 2.3. 
6.2.3. Show that “f is uniformly differentiable with derivative f’” is equivalent to 
JavkWx,y € [a,b](\x-y]<2*AxHy 
If’) ~ (i = f)/(y - 1 < 24). 
6.2.4, Complete the proof of 2.9 and give proofs of 2.10, 2.12-14. 


6.3.1, Let us call f€Q—R_ sequentially continuous, if whenever lim(x,), =x, then 
lim¢ f(x, Yn = f(x). Show on the basis of WC-N that a sequentially continuous f is 
continuous. Hint: consider any r © Q and construct a suitable spread 7’ such that under a 
mapping 9» given by pa = (r,,), (relative to a standard enumeration (r,), of Q) all 
fundamental sequences converging to r within a certain fixed prescribed rate of convergence 
are represented; apply WC-N with respect to this spread (Troelstra 1967). 


6.3.2. X CR is separable iff there is a (x,,), © X such that Vx € XWKAx, (|x — x,| < 27%). 
Generalize the result of the preceding exercise to sequentially continuous f € X > R (Troelstra 
1967). 


6.3.3. Let f ER — R be continuous and sequentially differentiable at x, that is, whenever 
lim(x,,), = x and Vn(x, # x) then lim((f(x,) — f(x))/ (x, — )), exists. Show that f is 
differentiable at x (van Rootselaar 1952). 


_ 63.4. Assume —.4-PEM and show that every f © R — N must be constant. 
Hint: prove first that there is no function defined on a closed subset X C R such that for some 
(4n)n © X, f(a,) = 9 for ali n and f(im(a,),,) = 1. 
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6.3.5. Assume —3-PEM and show that any f © R — R is weakly continuous in the following 
sense: there is no sequence (x,,),,lim(x,),, = x such that Wn(|fx, — fx| = 27*) for some k 
(cf. the preceding exercise). 


6.3.6. Let (/,,), be any sequence of rational intervals of finite length covering R. Show how to 
construct a covering (J,,), of rational open intervals such that 

(i) WaAm(S, C In) (Jy) Fefines (7,,),). 

(ii) Vam(J, CJ, 2 2 =m), 

(iii) Vx € RAmmV n(x € J, 7 n=mVn=m’) (CJ,), has finite overlap). 


6.3.7. Assuming C-N, prove the following version of “Lindeléfs theorem” for R: if 
{W,:1€ X} is any open cover of R, then there is an enumerable subcover {W,,,): 7 © N} 
for some 9 € N > X (Troelstra 1966, 2.2.6; Troelstra 1968, Theorem 8). 


6.3.8. Assume C-N. Show that any f € R — R is continuous with a contirfuous modulus of 
continuity, i.e, for all k there is a continuous g, € R > R* (where R*:= {x: x > 0}) such 
that Vxy((x — yl < g(x) > [f(x) — f(y) < 27“). Hint: use the two preceding exercises to 
construct a cover ((s,.¢,)), With finite overlap for R, s,,¢, € @, such that 
Vx,¥ E (5p. ty) (f(x) — £0) < 274). Define 


8, (x) = sup{ [min(max(x — s,,0),max(t, — x,0))]): EN}. 
6.3.9. Refine the result of the preceding exercise to obtain: there exists a continuous g(y, x) 
with y > 0 > g(y,x) > 0,» <0 g(y, x) = 0, and 


VxyWz > O(|x — yl < g(z, x) > Ife - fri < z). 


6.3.10. Let 
00 90 od 
S = > Xn» S, = y Xa(n)> S* = > [xn 
n=@Q n=0 n=0 


where o is a permutation of N, ie. a bijection from N to N. S converges absolutely iff S* 
converges. Show that (i) If S converges absolutely, then so does S,. We say that S diverges iff 


= 


Show that (it) If S converges and S* diverges, then S, can take any value a in R* = RU 
{—00, +00} for a suitably chosen permutation o depending only on a. 


ke 
L * 
n=0 


Vm3kVk' > ( 


6.3.11. Let S, S, be as in the preceding exercise. Assuming WC-—N, show that if S, converges 
for all permutations o, then S is absolutely convergent. Hint: first prove the theorem under 
the assumption Vn(x, © @); construct a spread representing all permutations of N > N (W. 
Gielen; a proof is in Troelstra 1977, 6.11.) 
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6.4.1. Verify in detail the inequality 


lf) -fOIs L214) -460)1 


n=0 


in the proof of 4.4. 
6.4.2. Show in 4.5: if no finite part of (/,,), covers, then Z = N. 
6.4.3. Prove 4.6(ii). 


6.4.4. Define 
3(n) = Yo 2a, (k) 3 
k=0 


where a, is the characteristic function of Ju < n(Txxu A Uu=0), ie. a,x =le dus 
n(Txxu A Uu = 0). Show that (8(n)), is a Specker sequence, and that for the Cantor 
discontinuum C we have 


Vy © Caknvk’ > k((8(k’) — y| = 27") 
(compare the proof of 4.7.6, theorem). 
6.4.5. Construct a strictly monotone strong Specker sequence. 
6.4.6. Construct a Specker sequence assuming the existence of a lawless 01-sequence (cf. 4.6.2). 
6.4.7. Show, using strong Specker sequences instead of singular coverings, how to construct 
(i) an f € [0,1] > R which is continuous and unbounded; 
(ii) an f € (0,1] — [0,1] which is continuous but not uniformly continuous; 
(iii) an f © [0,1] > [0,1] with inf( f) = 0, Vx € [0,1]( f(x) > 0). 
6.4.8. Prove the assertion of exercise 6.3.11 on the assumption of ECT, + MP (Beeson 1976). 
6.4.9. Give a simplified version of theorem 4.12 establishing the special case: all functions 


from NN to N are continuous, where N™. has the usual product topology and N the discrete 
topology (cf. 4.1.5, 5.3.15). 
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In this index we have included notions and terminology of more than 
strictly local significance. The page numbers following an entry refer to the 
places where a definition, modification or extension of an earlier definition, 


or a notational convention concerning the entry is to be found. 


absolute difference (arithmetic), 116 
abstraction operator, 144 
absurdity rule, classical, 46 

— -, intuitionistic, 40 
accumulation point, 265 

Aczel slash, 139 

algorithm, 131 

almost negative, 197 

analysis, elementary, 144 

-, recursive, 320 

apartness, 255, 280 

application operator, 144 

—, continuous function, 157 
arithmetic (of reals), 206, 277 
arithmetic, Heyting, 126 

—, ~ (in higher types), 170 

—, ~ (second-order), 164 

—, intuitionistic first-order, 126 
-, — higher-order, 170 

—, — second-order, 164 

—, primitive recursive, 120 
arithmetical comprehension, 167 
assumption, 39 

~, cancelled, 39 

-, eliminated, 40 

-, open, 39 

axiom of choice, quantifier-free, 144 
~- — countable choice, 189 

— — dependent choice, 191 


— ~ extensionality (higher-order logic), 169 


- ~ partial choice, 244 
- ~ relativized dependent choice, 191 


-, comprehension (higher-order logic), 169, 


171 
-, — (second-order logic), 162 
-, fan, 217 
-, presentation, 190 
-, product (higher-order logic), 171 
-, projection (higher-order logic), 171 


Baire space, 187 
ball, open, 265 

bar, 224, 233 

- continuity, 228 

— induction, 229 

-— ~, decidable, 229 
- ~, double, 249 

— =, monotone, 229 
—- theorem, 227, 233 
basic sort, 169 
basis, 187 

BCM, 28 
BHK-interpretation, 10 


Bishop’s constructive mathematics, 3, 28 


Bolzano-Weierstrass theorem, 268 
bound variable, xv 

—, greatest lower, 294 

—, least upper, 294 
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bounded (set of cuts), 273 — (for Kripke models), 89 

— extended real, 270 composition (closure under), 115 
— minimum operator, 117 comprehension axiom, 188 

~ sequence, 267 ~ —, (higher-order logic), 169, 171 
-, totally, 294 ~ -, (second-order logic), 162 

-, weakly (cut), 270 ~ term, 163 

-, ~ (set of cuts), 273 ~, arithmetical, 167 

branch, 187 -, full 162, 169, 171 

Brouwer’s continuity theorem, 307 ~, function, 188 
Brouwer—Heyting-Kolmogorov ~, impredicative, 162, 169, 171 


interpretation, 10 concatenation, 119, 186 


connectedness of the interval, 293 


cancelled assumption, 39 conservative extension, xviii, 93 
canonical model construction, 88 consistent (pair), 86 

Cantor space, 187 constructive mathematics, Bishop's, 3, 28 
cartesian product, xvii - -, recursive, 3, 25 

Cauchy completeness, 267 -, formula, 60 

— modulus, 253 —, negative, 61 

— real, 253 —, positive, 61 

~ sequence, 266 -, Strictly positive, 66 

CDC, 41 continuity, strong, 212 
characteristic function, xvii ~ for disjunctions, weak, 209 
choice sequence, 206 - theorem, Brouwer’s, 307 
choice (see “axiom of”) — with uniqueness, weak, 211 
Church’s mile, 243 -, bar, 228 

— ~, extended, 243 -, function, 216 

— thesis, 192 -, local, 209 

~ — for disjunctions, 193 -, special, 215 

— ~, arithmetical form of, 193 -, weak, 209 

— ~, extended, 199 continuous, 188 

— -, partial, 244 ~ function application, 157 
classical, 1 ~, locally uniformly, 260, 277 
- absurdity rule, 46 -, sequentially, 211 

— logic, 46 -, uniformly, 260, 266, 277 

— predicate logic, 48 convention, crude discharge, 41 
— propositional logic, 48 convergence, 266 

— real, 270 —, Negative, 288 

— reals, characterization of, 275 conversion, lambda-, 144 
closed, 63 correct, intuitionistically, 75 
closure (of a pointset), 265 countable choice, axiom of, 189 
coincidence (of fundamental sequences), 253 counterexample, weak, 8 
compactness, 218 course-of-values function, 119 
— (of the interval), 305 ~ recursion, 119 

complement, xvii cover, 307 

complete(ness), Cauchy, 267 ~, epsilon-bounded, 307 


-, order-, 273 —, interval, 307 


-, quasi-, 311 

~, segmental, 307 

—, Singular, 307 

-, universal, 307 
creative subject, 236 
CRM, 25 

crude discharge convention, 41 
cut, 270 

-, Dedekind, 288 

—, left, 270 

-, weak, 270 

-, weak left, 270 

-, weakly bounded, 270 
cut-off subtraction, 116 


d-formula, 64 

decoding function, 119 
Dedekind cut, 288 

~ real, 270 

- -, characterization of, 275 

— -, strong, 274 

deduction (Hilbert-type), 68 

- (natural), 39 

~ from axioms, 47 

~, natural, 37 

definability, explicit, 175 
definition by cases, 153, 273 

- symbol, xv 

definitional extension, xix, 93 
dependence (on assumptions), 41 
dependent choice, 191 
derivative, 295 

description axiom, 55 

— operator, 55 

descriptor, 55 

differentiable, 295 

-, sequentially, 323 

-, uniformly, 296 

discharge convention, crude, 41 
discharged assumption, 40 
discrete sets, subcountability of, 241 
— topology, 266 

disjunction property, 139 
domain function (for Kripke models), 80 
double induction, 124 

— negation shift, 106 


Index 


E-logic, 50, 52 

E*-logic, 51, 52 

elementary analysis, 144 
eliminated assumption, 40 
elimination rule, 40 
enumerable, recursively, 134 
epsilon-bounded cover, 307 
equality (E-, E*-logic), 52 

— (logic with), 48 

-, stable, 256 

-, theory of, 85 

equivalence class, xvii 

E-rule, 40 

essentially existence-free, 197 

— isolating, 63 

excluded middle, principle of the, 10 
existence predicate, 50 
existence-free, essentially, 197 
explicit definability for numbers, 139 
- — for sets, 182 
exponentiation (arithmetic), 116 
extended Church’s rule, 243 

— — thesis, 199 

— real, 270 

— real, bounded, 270 

— reals, characterization of, 275 
extension, conservative, xviii, 93 
-, definitional, xix, 93 
extensional, 191 
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extensionality axiom (higher-order logic), 169 


external(ly valid), 78 


fan, 186 

- axiom, 217 

— — (for the binary tree), 219 
— theorem, 217 

filtration method, 91 

finitary spread, 186 

finite, 14 

— sequence (coding of a), 118 
finitely indexed, 14 

finitism, 2, 25 

forcing (Kripke), 77, 81 
formula context, 60 

~ occurrence, 59 

-, almost negative, 197 


IV Index 


-, geometric, 110 Hilbert’s programme, 25 

-, negative, 57 Hilbert-type system, 68 

-, one-variable, 49 homeomorphism, 188 

~, positive, 110 hypothetical reasoning, 30 

—, Rasiowa—Harrop, 143 

fragment, negative, 57 idealized mathematician, 236 

free variable, xv idempotency of realizability, 198 

full comprehension (higher-order logic), 169, identity (literal), xv 

171 impredicative comprehension (higher-order 

~ — (second-order logic), 162 logic), 169, 171 

function, 191 — — (second-order logic), 162 

— application, continuous, 157 inclusion, xvii 

— comprehension, 188 independence of premiss rule, 138 

- continuity, 216 index, 154 

-, characteristic, xvii induction, 114 

-, concatenation, 119 — axiom schema, 126 

-, course-of-values, 119 ~ over unsecured sequences, 230 

-, decoding, 119 — rule, 120, 126 

-, inverse signum, 116 -, bar, 229 

-, neighbourhood, 211 -, double, 124 

-, partial recursive, 132 -. special, 121 

-, primitive recursive, 115 -, transfinite, 129 

-, recursive, 132 inductive definition, 145 

-, result-extracting, 133 - -, generalized, 231 

-, Signum, 116 — tree, 225 

-, Skolem, 85 infimum, 294 

-, total recursive, 131 infinite set, 288 

—, transition, 85 inhabited, 16 

-, zero-, 115 inseparable, recursively, 134 

functional type, 173 integer, 251 

fundamental sequence, 253 integration, Riemann, 300 
intensional, 191 

gl.b, 294 interior, 188, 266 

generalized inductive definition, 231 intermediate-value theorem, 292, 293 

— type, 172 internally valid), 78 

geometric formula, 110 interval cover, 307 

Glivenko’s theorem, 106 -, closed, 291 

Gédel-Gentzen translation, 57, 128 (see also -, connectedness of the, 293 

“negative translation”) -, half-open, 292 

greatest lower bound, 294 ~, open, 291 
introduction rule, 40 

hesitant sequence, 208 intuitionism, 4, 20 

Heyting arithmetic, 126 intuitionistic absurdity rule, 40 

~ — in higher types, 170 — first-order arithmetic, 126 

— -, second-order, 164 — higher-order logic, 170 


higher-order logic, intuitionistic, 170 — predicate logic, 48 


— logic (with E), 52 
propositional logic, 48 
second-order arithmetic, 164 
second-order logic, 161 
intuitionistically correct, 75 

- valid, 75 

isolating, 63 

-, essentially, 63 

isotone, strongly, 110 
iteration (of a function), 117 
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K-valid, 78, 81 

Kleene brackets, 133 

~ slash, 179 

Kleene’s T-predicate, 133, 155, 156 

KLST-theorem, 321 

Kolmogorov’s translation, 59 

KGnig’s lemma, 8 

Kreisel-Lacombe-Shoenfield-Tsejtin 
theorem, 321 

Kripke model, 77, 80, 84, 85 

— — (E*-logic), 86 

~ -—, normal, 109 

- -, propositional, 77 

- -, truncated, 78 

~ semantics, 75 

Kripke valid, 78, 81 

Kripke’s schema, 236 

~ —, weak, 214 

Kuroda’s translation, 59 


lu.b, 294 
lambda-conversion, 144 
lattice, Rieger-Nishimura, 50 
lawless sequence, 207 
lawlike, 192 

least number principle, 129 
— upper bound, 294 

length, 119 

limit, 266 

Lindeléfs theorem, 216, 324 
local continuity, 209 
locatedness, 270 

logic with existence predicate, 50 
~, classical, 46 

~, — predicate, 48 
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—, — propositional, 48 

—, intuitionistic higher-order, 170 
-, — predicate, 48 

-, - - (with E), 52 

-, — propositional, 48 

—, — second-order, 161 

—, minimal predicate, 57 

logical operation, 9 


Markov’s principle, 203 

— rule, 129, 136 

maximum (arithmetic), 116 

—, existence of, 295 

metric space, 265 

minimal predicate logic, 57 
minimalization, 131, 135 
minimum (arithmetic), 116 

— operator, 132 

- -, bounded, 117 

model construction, canonical, 88 
-, Kripke, 77, 80, 84, 85 

-, — (E*-logic), 86 

-, normal Kripke, 109 

modulus (of a Cauchy sequence), 266 
- of convergence, 266 

-, Cauchy, 253 

monotone inductive clause, 147 

— sequence, 267 

— -, decreasing, 267 

— -, increasing, 267 

— -, non-decreasing, 267 

— -, non-increasing, 267 

—, strictly (function), 322 

—, — (sequence), 267 

-, Strongly (function), 322 
monotonicity (of Kripke models), 78 
-, Strong (of cuts), 270 


natural deduction, 37, 39 
negative context, 61 

— formula, 57 

— fragment, 57 

— occurrence, 61 

— translation, 57, 59, 128 

— — (HAS), 165 

— — (higher-order logic), 170 


VI 


—, almost, 197 

negatively convergent, 288 

neighbourhood function, 211 

— function principle, 215 

~— —, generating of a, 225 

node (of a Kripke model), 77 

non-empty, 16 

non-extensional, 191 

non-oscillating, 288 

normal form theorem (recursion theory), 155, 
156 

- Kripke model, 109 

numeral, xviii 


occurrence, formula, 59 

—, negative, 61 

-, positive, 61 

-, strictly positive, 66 

-, subformula, 59 
omniscience, principle of, 194 
-, — — limited, 194 

open (pointsets), 188, 266 

— assumption, 39 

— ball, 265 

operation, 191 

—, logical, 9 
order-complete(ness), 273 
overlap, finite (of a cover), 307 


p-morphism, 110 

pairing (arithmetic), 118 
parameter (Kripke models), 80 
parametrization principle, 235 
partial choice, axiom of, 244 
- Church’s thesis, 244 

— recursive function, 132 
point (of a metric space), 265 
— (of a topological space), 188 
positive context, 61 

~- formula, 110 

— occurrence, 61 

Post’s theorem, 205 

power type, 169 

powerset, xvii 

predecessor function, 116 

-— -, classical, 48 


Index 


— —, intuitionistic, 48 

— -, -— (with E), 52 

— —, language of, 36 

— —, minimal, 57 

predicative, 2 

predicativism, 2, 24 

prenex formula, 104 
presentation axiom, 190 

prime number, 117 

primitive recursive arithmetic, 120 
~ — function, 115 

~ — relation, 117 

— type, 169 

principle of omniscience, 194 
— — -, limited, 194 

—, least number, 129 

-, Markov’s, 203 

-, parametrization, 235 

—, separation (SEP), 195 

—, Shanin’s, 245 

product axiom (higher-order logic), 171 
- type, 169 

~, cartesian, xvii 

programme, 133 

projection (arithmetic), 115 

— axiom (higher-order logic), 171 
~ function, 115 

propositional logic, classical, 48 
— ~, intuitionistic, 48 

- —, language of, 37 

- variable, 37 


quasi-cover, 311 


re., 134 

Rasiowa-Harrop formula, 143 
rational, 251 

real, bounded extended, 270 
-, bounded singleton, 281 
~, Cauchy, 253 

-, classical, 270 

-, Dedekind, 270 

—, extended, 270 

-, singleton, 281 

-, strong Dedekind, 274 
realizability, 195 


Index VII 


-, characterization of, 199 segmental cover, 307 
reasoning, hypothetical, 30 semantics, Kripke,-75 
recursion theorem, 134, 154 semi-intuitionist, 18 

— — (for function application), 159 separation principle (SEP), 195 
~, closure under, 115 sequence, Cauchy, 266 

~, course-of-values, 119 -, choice, 206 

-, definition by, 115 -, finite (coding of), 118 

-, simultaneous, 178 —, fundamental, 253 
recursive analysis, 320 —, hesitant, 208 

— function, 132 -, lawless, 207 

- -, partial, 132 —, Specker, 268 

— -, total, 131 -, strong Specker, 311 

— mathematics, constructive, 3, 25 sequentially continuous, 211 
recursively enumerable, 134 - differentiable, 323 

- inseparable, 134 set-existence property, 182 
recursor axioms, 144 sfo, 59 

reflection, 153 Shanin’s principle, 245 
reflexivity, 48 signum function, 116 
relation, primitive recursive, 117 — —, inverse, 116 

relativized dependent choice, axiom of, 191 simultaneous recursion, 178 
replacement (rule, schema), 48, 52 — substitution, xvi 
restriction, xvii singleton real, 281 
restrictions, Kripke model with, 85 — -, bounded, 281 
result-extracting function, 133 singular cover, 307 
Rieger-Nishimura lattice, 50 Skolem function, 85 
Riemann integration, 300 slash, Aczel, 139 

Rolle’s theorem, 296 -, Kleene, 179 

root (of a Kripke model), 90 smn-theorem, 133, 153, 154 
rule, Church’s, 243 — (for function application), 158 
-, elimination, 40 sort, basic, 169 

-, extended Church’s, 243 soundness (for Kripke models), 82, 84 
-, independence of premiss, 138 ~ (for realizability), 199 

—, induction, 120, 126 space, metric, 265 

-, introduction, 40 —, topological, 188 

-, Markov’s, 129, 136 special continuity, 215 

-, substitution, 51 ‘— induction, 121 

-, — (arithmetic), 120 Specker sequence, 268 


— -, strong, 311 


S.p.p., 66 spread, 186 

saturated, 87 -, finitary, 186 

saturation lemma, 88 —, universal, 187 

schema, Kripke’s, 236 spread-representation, 303 
second-order arithmetic, intuitionistic, 164 spreading, 63 

— Heyting arithmetic, 164 stable equality, 256 

- logic, intuitionistic, 161 strictly monotone sequence, 267 


segment, 291 — positive context, 66 
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— — occurrence, 66 

— — part, 66 

— — subformula, 66 

strictness, 52 

— (of esti-relation), 171 

strong completeness for Kripke models, 89 
- continuity (for numbers), 212 
- monotonicity of cuts, 270 

~- Specker sequence, 268 
strongly isotone, 110 

~ monotone function, 322 
subcountability of discrete sets, 241 
subfinite, 32 

subformula occurrence, 59 

—, Strictly positive, 66 
substitution rule, 51 

— — (arithmetic), 120 

-, simultaneous, xvi 
subtraction, cut-off, 116 
subtree, 187 

supremum, 294 

surjection, xvii 


T-predicate (Kleene’s), 133, 155, 156 
Taylor series, 297 

tertium non datur, 10 

theorem, bar, 227, 233 

-, fan, 217 

-, Glivenko’s, 106 

—, intermediate-value, 292, 293 

-, normal form (recursion theory), 155, 156 
-, Post’s, 205 

—, recursion, 134, 154 

-, — (function application), 159 

-, Rolle’s, 296 

-, smn-, 133, 153, 154 

~, smn- (for function application), 158, 82 
~, soundess (Kripke models), 82, 84 
~, - (realizability), 199 

topological space, 188 

topology, 187 

— (metric space), 265 

~, discrete, 266 

total recursive functions, 131 

totally bounded, 294 
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transfinite induction, 129 

transition function, 85 

transitivity rule, 48 

translation, Gddel-Gentzen, 57, 128 (see also 
“negative translation”) 

-, Kolmogorov’s, 59 

-, Kuroda’s, 59 

—, negative, 57, 59, 128 

-, — (higher-order logic), 170 

-, — (for HAS), 165 

tree, 186 

-, binary, 187 

-, inductive, 225 

—, universal, 187 

—, well-founded, 224 

true (at a node in a Kripke model), 77, 81 

truncated Kripke model, 78 

tuple, xviii 

— (coding of a), 118 

type, 169 

- theory, 174 

-, functional, 173 

-, generalized, 172 

-, power, 169 

-, primitive, 169 

-, product, 169 

-, variable, 174 


ultra-finitism, 6, 29 

uniformity principle (for functions), 235 
— — (for numbers), 234 

uniformly continuous, 260, 266, 277 

- -, locally, 260, 277 

— differentiable, 296 

universal cover, 307 

— spread, 187 

— tree, 187 

unsecured sequences, induction over, 230 


valid, xix 

— (at a node in a Kripke model), 78, 81 
-, intuitionistically, 75 

-, Kripke, 78, 81, 

variable type, 174 

-, propositional, 37 


w.l.c., 270 

weak counterexample, 8 

weakly bounded cut, 270 
weakly bounded set of cuts, 273 
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LIST OF SYMBOLS 


Part I lists the abbreviations designating formal systems, part II the 
abbreviations for axioms or axiom schemas. The listing is alphabetical, 
disregarding mathematical symbols which are not roman letters. Part III 
lists other notations in order of appearance. Notations which are used only 
locally (‘ad hoc’) are not listed as a rule. 


I. Formal systems II. Axioms and rules 

CPC 48 AC 189 
Cac 48 AC! 99 
EL 144 AC, 189 
HA 126 AC oo 189 
HAH 170 ACy! 188 
HAS 164 ACoy 189 
HAS, 167 AC-DD’ 190 
H-IQC etc. 68 AC-N 189 
H-IQCE etc. 73 AC-NF 189 
IPC 48 AC-NN 189 
IQC 48 AC-NN! 194 
1QC? 160 AP}, 2, 3 256 
IQCE 52 BC-N 228 
IQCE* 52 BI, 229 
MQC 57 Bly, 229 
N-CQC etc. 49 as 46 
N-IQC etc. 49 CA 161, 169 
PA 128 C-C 217 
PRA 120 C-D 214 
TYP 169 C-N 212, 214 
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AI, VI, - I, VI, dE 
VIF, aIF 


List of Symbols 


SHP 


215 
195, 210 
245 
52, 73 
51, 120 
129 
48 
234 
235 
235 
209 
211 
216 
209 
241 


List of Symbols XIII 
III. Other notations 
n,m are used for natural numbers, a for a function from natural 


numbers to natural numbers, A, A’, A” for formulas, X, Y for sets, ¢ for a 
term, a for arbitrary expressions. 


= xv (Xigiscvy Me) XViil 
= xv f= XViii 
FV(a), BV(a) Xvi NXes Net XVili 
Of Xpeee pX/bysceyty)s NX, X 7,026, Xp_el XVili 

a[x/i] xvi graph( f), dom( f ) 
A,V,27,+4,V, range( f ) XViii 

q,-, 0,3! xvi, 9 N,Z,Q,B,R,C xviii 
>, oe Xvi n,m,i, j,k xviii 
AeA oe A” XV1 a, B, y, 6 xviii 
A,V xvi n,m XViii 
Vx © X,4x eX xvi (a,)n XViil 
VXp%o056.2X gs &(H) XViii 

AX |xXy...X, XVii b xix 
Vx,y €A,4x,y EA Xvii E 13 

g XVil A,(P) 49 
E,¢ xvii = 52, 54 
c,2> XVii Ix.A 55 
A, U,N,U,\ XVii Fe Fin 57 
(Recouead XVii Ag 57 
te: A}, (f(x): A} XVii A*, Ad 59 

xvii CON 60 

oe Y,(a, a’) xvii POS, NEG 61 
Il xvii SPOS 66 
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